MAY 1 9 2005 a 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



)PECDFDCATIOM 



Docket No. 0635MH-40874 



TO ALL WHOM IT MAY CONCERN: 

BE IT KNOWN that we, Frederick S. M. Herz, David C. Parks, and Sampath 
Kannan, residing in the state of Pennsylvania, Paul Labys, residing in the state of Utah, and 
Jason Eisner, residing in the state of New York, have invented new and useful 
improvements in a 

SECURE DATA INTERCHANGE 
of which the following is a specification: 



Page 1 of 370 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 

1 CROSS REFERENCE TO RELATED APPLICATIONS 

2 The present application claims the benefit of United States Provisional 

3 Application No. 60/161,640, filed October 29, 1999, titled SECURE DATA 

4 INTERCHANGE, and Provisional Application No. 60/206,538, filed May 23, 2000, 

5 titled SECURE DATA INTERCHANGE, both of which are hereby incorporated by 

6 reference. 

BACKGROUND OF THE INVENTION 

7 1 . Field of the Invention: 

8 The Secure Data Interchange invention describes a system to allow a privacy- 

9 protected market for data exchange between multiple self-interested parties. The system 

10 presents a general infrastructure for the exchange of information within a safe privacy- 

1 1 protected environment, between multiple self-interested parties. We propose a central data 

12 warehouse that maintains data submitted by different users, and executes queries and 

1 3 programs on the data. Rules are associated with data that define how the data can be used 

14 and queried, to allow agents that submit data to maintain absolute control over its use. SDI 

1 5 acts as a trusted-intermediary to all parties, and implements an internal market for queries on 

16 the information, allowing agents to specify prices for data access. Furthermore, SDI 

1 7 supports complex queries such as collaborative filtering, that can provide a querying agent 

1 8 with a one-time benefit of data access but without long-term access to the data that was used 

1 9 to compute valuable results. 

20 The invention relates to systems that provide personalized information, profiling, 

21 automated matchmaking and information exchange, providing a framework that protects 

22 privacy and allows information collection and profiling within a carefully controlled 

23 environment. Although the marginal cost of data duplication is small, there are hidden costs 

24 associated with data, for example because of privacy concerns, and data can be a valuable 

25 resource in many problems. In business-to-consumer (B2C) applications Secure Data 

26 Interchange addresses the direct conflict between the goal of personalization and the need 

27 for privacy, preventing the exchange and collection of information without knowledge and 

28 consent. In business-to-business e-commerce applications (B2B) SDI allows vendors to 
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1 provide sensitive and valuable information, for example about business needs and customer 

2 bases, in a secure environment that controls access and leverages value. 

3 2. Description of the Prior Art: 

4 The invention of Secure Data Interchange relates to a wide-range of application 

5 domains, all of which are characterized by a need to exchange information in a privacy- 

6 protected and carefully controlled market-based environment. 

7 As a key application we suggest a system for personalized information delivery in a 

8 networked environment, in which the SDI-proxy can fact as a local filter on information, 

9 based on what it knows about a user's preferences and methods for filtering pushed by the 

1 0 provider of content. The system allows collaborative filtering through information that is 

11 provided to the central data warehouse, but never released directly to other agents; 

12 collaborative filtering methods are computed in the central SDI data warehouse. Further 

13 motivation is provided with reference to some electronic commerce applications, that we 

14 describe in (A) business-to-consumer and (B) business-to-business e-commerce 

15 applications. 

1 6 In addition to applications within commerce, the system of Secure Data Interchanges 

17 is central to developing many other new products. Examples include the formation of "self- 

18 help" groups between a set of individuals with common interests, and applications to 

1 9 personal information delivery systems, e.g. for educational and informational purposes. 

20 A. Business-to-Consumer (B2C) Electronic Commerce. 

21 

22 The recent explosion of electronic commerce, in particular Internet-based individual- 

23 to-business electronic commerce, presents new opportunities for automated personalized 

24 information delivery and the automated customization of products and services. This type of 

25 personalization is very valuable to vendors because it can increase sales volumes, enable 

26 cross-selling and up-selling of goods and services, and allows vendors to price products 

27 dynamically based on information about the preferences and goals of customers. 

28 Personalization is also useful to customers when it correctly identifies the requirements and 

29 preferences of a customer, because it can reduce search cost and enhance the "shopping 
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1 experience". Perhaps a customer can find the good or service (i.e. desirable 

2 price/quality/feature tradeoff) that he/she wants more quickly than without personalization, 

3 or receive information about an interesting new product or service that he/she did not know 

4 about. 

5 The basis for these new services is that Internet-based "shop fronts" can be 

6 individualized on a per-customer basis, dynamically and in real-time. Traditional main-street 

7 shops must offer the same store layout to every customer, because the layout is physical, 

8 although some level of personalized service can be achieved through well-trained sales 

9 assistants, that act as a "guide" for a customer within a store. On-line "shop fronts" are 

10 virtual, and configurable at negligible cost to the customer or the vendor, assuming that 

1 1 computation is cheap and fast. 

12 Furthermore, Internet-based electronic commerce can allow business to collect vast 

13 amounts of consumer information, because customers interact through a computer-based 

14 interface. Customers can be monitored as they browse a Web site for products and services. 

15 Information such as the search-terms that users enter into a search engine, the links that 

16 users follow, and the length of time spent on each page, can all provide an insight into the 

1 7 current goal of a customer, i.e. the type of product that he/she wants. When combined across 

1 8 different sessions, and with similar information about the browsing and purchasing habits of 

19 other customers, the information can be folded into a long-term view of the preferences and 

20 needs of a customer. 

21 Moreover, new network connectivity enables different vendors to exchange profiles 

22 for common customers, either statically or dynamically, in order to build broad and detailed 

23 profiles across vendor domains. There exist many potentially powerful synergies between 

24 the data sets that are collected by different vendors, that can be leveraged to provide 

25 appropriate services and products to customers. When analyzed with the proper statistical 

26 tools these data sets can reveal fundamental patterns in the behavior of users, and enable a 

27 vendor to provide appropriate information to a user. Furthermore, access to user-profiles 

28 collected by other vendors can enable vendors to provide focused information delivery to 

29 first-time users, and also cross-market services with other appropriate vendors. 
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1 Providing user profile information within a carefully controlled environment can 

2 benefit vendors and users: 

3 o Vendors would find benefit in sharing data with other vendors; this would deepen 

4 their understanding of their customers' behaviors and preferences, especially if some 

5 customers were traceable across several data sets. 

6 o Users would benefit from sharing data with other users. This is already evident in 

7 the popularity of news groups and web discussion pages catering to individuals with 

8 shared interests. By learning what other people with similar tastes and preferences 

9 have discovered and enjoyed, a user can sidestep information overload in the search 

1 0 for personally satisfying information. 

1 1 o Vendors can benefit from receiving data about users. An obvious example would be 

12 in the use of collaborative filtering for the marketing of targeted promotions; rather 

13 than being deluged with coupons and advertisements that are of absolutely no 

14 interest, a user would benefit by being presented with advertising that is highly 

15 relevant. In the process, the vendor would increase advertising response rates, 

1 6 boosting overall efficiency. 

1 7 o Users can receive benefits from providing information to vendors. Personalization 

18 of content at vendors' web pages, and well-focused banner advertisements at other 

1 9 web sites that they visit. 

20 The problem is that a user wants controlled personalization, in the sense that it might 

21 not be desirable for information about every on-line transaction that a user performs, every 

22 on-line document that a user reads, and every web page that a user visits, and demographic 

23 information, to be available to every business that the user interacts with, in the virtual and 

24 physical world. 



25 A.1 Focused Banner Advertising/Content provision 

26 Internet-based media sites have followed preceding formats in generating revenue 

27 from advertising, with content to users often provided free-of-charge. The business model is 

28 similar to that in newspapers, magazines, and television, where circulation and 

29 audience/readership demographics are used to drive revenue. Electronic media presents new 
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1 opportunities for media-based business: for example multimedia techniques and 

2 interactivity, personalized delivery of information, and personalized targeting of advertising. 

3 The problem - as before, is to acquire and leverage information about the 

4 preferences and interests of a user, within a system that protects user privacy (i.e. controls 

5 the collection and exchange of information about users, and controls the use that is made of 

6 that information). A further problem is to extrapolate information from a large corpus of 

7 data about an individual user. 

8 A. 2 Mailing Lists As another example, suppose that business A requests a list of 

9 individuals that meet a particular criteria. Consumer B meets the criteria, but is only listed 

10 for business A if A also meets criteria specified by B, for example if A will provide 

1 1 information about new products and services that are interesting to B. In an application to 

12 the profiling of users on-line, the problem is that users want to receive the benefits of 

1 3 targeted products and advertisements, but want to avoid the abuse of profile information and 

14 control vendors' access to that information. 

15 B. Business-to-Business (B2B) Electronic Commerce 

16 The Internet provides businesses with network connectivity with other business, 

1 7 both competitors and partners. This connectivity allows businesses to exchange information 

1 8 about customers (dynamically or statically), in order to identify potential new customers, 

19 build better profiles for existing customers, and up-sell/cross-sell products and services in 

20 real-time. The problem with this exchange of information (that can include swaps, sells, and 

21 rental access) is that businesses need to (a) protect the privacy of their customers; (b) 

22 prevent information release to competitors, either directly or through third-parties. 

23 B.1 Privacy-Protected Identification of Synergies/Matches There are many 

24 scenarios where autonomous agents would like to be informed of matches under conditions 

25 of mutual consent, but without information leakage to any agent if any one of the agents 

26 declines the match. Consider two vendors, A and B, and suppose the vendors seek strategic 

27 partnerships with other vendors that have appropriate skills and goals. However, vendor A 

28 does not want to broadcast to all vendors its need for a business partner or a new alliance, 

29 instead vendor A wants to be introduced to another vendor with the right mix of capabilities; 
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1 similarly for vendor B. What is required is a system that only introduces vendor A to vendor 

2 B, and perhaps anonymously at first, if both vendors consent to the introduction. The 

3 problem is to provide information that enables matches, without allowing bad matches and 

4 abuse of information - i.e. within an environment of secure data interchange. 

5 B.2 Credential-based Introductions, Contracting and Messaging-systems. There 

6 are many situations where individual parties, for example individuals or businesses, require 

7 introductions to credentialed individuals and/or businesses, with the aim of building a new 

8 relationship or making a new contract. Consider for example business associations, where 

9 credentials about non-bankruptcy, and no previous attempts to defraud could be important. 

10 Consider social introductions, where individuals might be concerned about past criminal 

1 1 activities of new contacts. In the domain of automobiles, we could consider a system that 

12 identifies other automobiles in the physical location of a vehicle that have recently been 

1 3 involved in an accident. The problem is to manage certificates within a system where users 

14 can maintain multiple identities, and to protect the release of certificates without suitable 

1 5 provisions for terms-of-use and criteria for request. 
16 
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SUMMARY OF THE INVENTION 

1 The above problems are solved, and a technical advance achieved, by the system of 

2 Secure Data Interchange. The Secure Data Interchange system enables information about 

3 bilateral and multilateral interactions between multiple persistent parties to be exchanged 

4 and leveraged within an environment that uses a combination of techniques to control access 

5 to information, release of information, and matching of information back to parties. 

6 The system of Secure Data Interchange (SDI) provides a trusted server containing a 

7 large database of information that is owned by its providers. Each data record has an 

8 associated price rule, that controls access to data. The pricing model allows a data owner to 

9 specify a price for different types and amounts of information access, and whether the 

10 identity of the information owner is required, and the system of SDI computes a composite 

1 1 price for a query based on aggregated prices for a query over a number of different data 

12 owners, with an internal market that favors low priced data. The pricing model allows 

1 3 discounts based on certificates of a requesting agent, and as a special case implements the 

14 standard capability-based access control systems, where information is provided to users 

15 with appropriate permissions (i.e. with zero and infinite prices). In addition, the system of 

1 6 Secure Data Interchange allows data to be submitted with a level of random perturbation 

1 7 (noise), to provide added privacy protection, or alternatively allow an agent to specify in 

18 conditions under which additional noise should be added to data. A query is priced before 

19 execution, to allow an agent to decide whether or not to execute a query, and select between 

20 alternative types of queries. Binding price quotes are provided to querying agents, and 

2 1 queries can be scaled to meet a budget. 

22 Data owners can submit data to the central data warehouse with different degrees 

23 of identification, for example anonymously, pseudonymously, or with a true identity. For 

24 example, in the case of data that represents a user's profile information, for example 

25 information about the interests of a user, a user might prefer to use a number of different 

26 pseudonyms for different types of activities that he/she likes to engage in online. A user 

27 might maintain a number of different aliases within the database, for example to represent 

28 different types of things he/she likes to do which have little bearing on each other. 
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1 Various types of queries can be executed on the server, ranging from traditional SQL 

2 style queries to collaborative-filtering style queries. The inventions of SDI is not predicated 

3 on the type of queries supported, describing instead a general system to execute those 

4 queries within a secure data-controlled environment. We leave the algorithms that 

5 implements the semantics of a query undefined, but define the interface between that 

6 algorithm and the data in the database, which is protected with price rules. 

7 A query can perform a considerable amount of computation on records in the 

8 database before any information is provided in response to a query. We allow general 

9 purpose programs to run on the server, that might for example perform collaborative 

10 filtering or other data mining techniques before returning aggregate information, for 

1 1 example a new model of the profiles of users of a particular interest to a vendor. However, 

12 the total charge for a complete query session is computed as the sum payment charged by all 

1 3 record pointers that are used in computing the final response. 

14 Furthermore, queries can also take an action on behalf of a querying agent, for 

15 example asking another agent to provide more information about something. We allow 

1 6 queries to be priced, based on a model of "revenue collection", in which the SDI server sells 

1 7 the right to access to data on behalf of the agents that submit data into the database. We also 

18 propose to allow "persistent queries", which reside on the server for a fixed period of time 

1 9 and return a response to the querying agent whenever conditions are met in the data. 

20 Interesting variations of SDI place data in different distributed locations, and move 

21 the control of information access between a central SDI server and distributed client-side 



22 SDI proxy agents. This allows different tradeoffs between privacy and information sharing. 

23 It also has implications for bandwidth and computational requirements within SDI. One role 

24 of a client-side SDI data warehouse is to provide the same functionality as the central shared 

25 SDI database, but with processing only performed on information provided by that agent. 

26 This can allow greater privacy by allowing a user to retain absolute control over his/her data 

27 on his/her local machine without even releasing data to the shared database. 

28 In an application to personalized on-line interactions, we describe a client-side SDI 

29 proxy which manages a user's interactions with the on-line sites of vendors and also 
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1 manages a user's interactions with the central SDI data warehouse, i.e. providing profile 

2 information and controlling profile access. The client-side SDI proxy for an agent that 

3 represents an individual browsing the Internet can manage that user's profiles in interactions 

4 with other agents, for example representing vendors and content providers. The client-side 

5 SDI proxy can also handle decisions about what types of information to submit to the server, 

6 and manages query execution on behalf of the agent. The client-side SDI proxy agent can 

7 also push information about a user's on-line activities to the central SDI data- warehouse in 

8 real time. This enables a system of "time-of-purchase-competition" system, in which a user 

9 can request competitive counteroffers from other vendors before making a purchase. 



10 The system addresses the fundamental conflict that exists between rights of privacy 

1 1 and efficiency gains from better bilateral exchange of profile/preference information. SDI as 

12 applied to B2C e-commerce allows consumers to receive targeted information about 

1 3 products and services, but without the loss-of-privacy that can easily occur in the current on- 

14 line profiling "free-for-all". The cookie technology provided by Netscape to supported 

1 5 personalized sessions with a single vendor on-line has been used by advertising network 

16 providers such as DoubleClick to track users across multiple sites, often without either the 

1 7 consent or knowledge of that individual [New York Times, Feb 7, 2000]. 



1 8 In describing the system of secure data interchange we claim the following novel 

19 technical ideas: 

20 (a) Agents can associate price-rules with information that is placed in the central 

2 1 database, and retain absolute control and ownership over all uses of that data. 

22 As a special case of price rules, the system supports access based on certified 

23 properties of querying agents (with zero and infinite prices). The pricing 

24 model allows information providing agents to receive direct value for data, 

25 and allows agents that request access to information to receive a price before 

26 a query is executed, and make appropriate decisions about what type of 

27 queries to execute. 

28 (b) A number of novel techniques are proposed to allow data processing within 

29 the data warehouse without releasing too much information to an agent. 

30 General programs, for example collaborative filtering techniques, can be 
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1 executed in situ within the data warehouse, so that agents can receive the 

2 aggregate benefits of information, without receiving details about the 

3 information. We might also allow an agent to receive anonymous or 

4 randomized information, and control the amount of information that can be 

5 received from a particular record. 

6 (c) The system of SDI can act as a trusted intermediary between agents, 

7 notifying agents about information of a particular type, and for example 

8 sending messages between agents without breaking the identity of agents 

9 except by consent. A special type of query that we call a persistent query 

10 allows an agent to maintain a permanent "searching" presence in the central 

1 1 database, always on the lookout for useful information. 

12 (d) In a key variation we push a small version of the SDI data warehouse onto 

13 an user's client computer, where it acts as a proxy agent, and further proxies 

14 an agent's interaction with other agents in real time. The local SDI data 

1 5 warehouse, called the client-side SDI database, is a trusted party to the client, 

16 trusted to maintain information that it is valuable to the client and use that 

17 information is appropriate ways. The advantage of this method is that an 

1 8 individual never needs to release sensitive profile information, it is always 

19 held on its local computer — but can still receive the benefits of 

20 personalization. 

21 (e) We also suggest a client-side SDI proxy that can collect information about a 

22 user, for example within an Internet browsing application, and periodically 

23 push the collected information to the SDI data warehouse in a controlled 

24 way. The client-side SDI proxy can also be responsible for certain data 

25 certification functions, and can manage a user's interactions with other 

26 agents to protect its privacy in non-SDI mediated transactions. 

27 As an application to B2C e-commerce, the system of SDI allows client-side personalization 

28 instead of provider-side personalization. Instead of passing profile information to a provider 

29 and receiving personalized information in return, providers can provide personalization 

30 methods that are used interactively with local profile information about consumers to target 

31 products and services without receiving explicit information about a user's profile. In a 
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1 simple form, the vendor provides complete information about its services, and a method to 

2 display them to the user based on his/her local profile. When describing the application of 

3 SDI to electronic commerce we also describe methods to implement necessary ancillary 

4 systems that are essential to supporting full e-commerce functionality within an identity- 

5 protected system, such as systems for pseudonymous payments and physical mailing of 

6 products. 

7 Collaborative filtering based on profiling information from multiple users is 

8 supported within the central SDI data warehouse, but within a system of economic 

9 incentives, where users provide profile information in return for receiving payments from 

10 vendors for that information. This allows broad network- wide information to be used for 

1 1 profiling, in addition to deep vendor-specific information. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

1 The novel features believed characteristic of the invention are set forth in the 

2 appended claims. The invention itself however, as well as a preferred mode of use, further 

3 objects and advantages thereof, will best be understood by reference to the following 

4 detailed description of an illustrative embodiment when read in conjunction with the 

5 accompanying drawings, wherein: 

6 Figures 1 through 21 illustrate various parts and embodiments of the invention. 



Page 13 of 370 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



DESCRIPTION OF THE PREFERRED EMBODIMENT 

1 1. Introduction 

2 The invention of Secure Data Interchange (SDI) describes a general infrastructure 

3 for the exchange of information within a controlled environment. We propose a central data 

4 warehouse that maintains data submitted by different users, and executes queries and 

5 programs on the data. Rules are associated with data that define how the data can be used 

6 and queried. As such, the system of SDI prevents the exchange and collection of 

7 information without knowledge and consent. The system allows for payments to be received 

8 by the providers of information, in return for data access. In application, the invention 

9 enables new systems for the delivery of personalized information, profiling and automated 

10 matchmaking and information exchange, all within a framework that protects privacy and 

1 1 maintains data security. SDI supports the collection and exchange of information between, 

12 and relating to, autonomous (and possibly self-interested) agents within a distributed 

1 3 environment. 

14 1.1 Definitions. 

1 5 Agent. An agent in SDI is a any party that wants to sell or give away data to other parties, or 

16 buy or receive data, or in general both provide and receive data. Agents may represent any 

17 party with individual goals, autonomy of control, and a persistent identity. Examples in 

18 business-to-consumer e-commerce include business such as newspapers, book stores and 

19 travel companies, that wish to receive data about the profiles and buying habits of users so 

20 that they can personalize the information, products and services that are sold to users. We 

2 1 assume that agents are autonomous from the system of Secure Data Interchange, and follow 

22 actions consistent with their preferences, abilities and resources. Similarly, an on-line 

23 consumer is an agent that wishes to provide data about its preferences and buying habits to 

24 vendors and other consumers, in return for well targeted products and financial reward. 

25 Agent Computers: Agents are represented in the system of secure data interchange with 

26 dedicated computational resources, agent computers, with permanent memory, processing 
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1 power, and network connectivity. For example, an agent computer might be a consumer's 

2 home PC that will act as a client machine in interactions with the central SDI data server and 

3 the servers of on-line vendors. For a vendor, an agent computer might be the server 

4 computer that it uses to execute its on-line business. Agent computers might also be thin 

5 clients, such as mobile computing devices, handheld devices, cell phones. We push different 

6 amounts of data and functionality within SDI to agent computers and away from web centric 

7 devices depending on the nature of the computer and an agent's preferences. 

8 Certificates: Certificates are used within SDI to establish trust between different agents, 

9 and to help agents to reach useful agreements. 

10 Profile: Each agent can have one or more profiles, which the agent can assume in its 

1 1 interactions with other agents. Part of a profile is a user's identity, that provides a (possibly 

12 limited) method for another agent to identify the agent in the future when it assumes the 

13 same profile. We allow three types of agent identities: anonymous, pseudonymous, 

14 persistent pseudonymous, and true identity. An agent may assume a profile and an identity 

1 5 in any interaction with another agent. 

1 6 o Anonymous. To assume an anonymous identity agent A creates a one-time identifier 

17 that it uses in interactions with one other agent. The identifier may allow the other 

1 8 agent to respond zero or one times. 

19 o Pseudonymous. A pseudonymous identifier created by agent A may be used with 

20 more than one other agent, and allows agents to respond to agent A as many times 

21 as they like, but agent A can terminate the pseudonym at any time and separate 

22 from the identity. Agent A can optionally restrict the number of agents that may 

23 reply to the pseudonym, for example to the agents that it explicitly provides with the 

24 identifier. 

25 o Persistent pseudonymous. A persistent pseudonymous identifier is created by agent 

26 A for use with agents in set S, and provides the added condition that agent A 

27 promises to use the same pseudonym for all interactions with all agents in set S for 

28 all time into the future. Agent A can optionally restrict the number of agents that 
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1 may reply to the pseudonym, for example to the agents that it explicitly provides 

2 with the identifier. 

3 o True identity. If agent A interacts with another agent under its true identity then it 

4 has no method to prevent the agent or any other agent responding to agent A in the 

5 future. 

6 The identity that an agent assumes with a profile matters not only to the ability of an 

7 agent to control the agents that can send it messages, but also its ability to control the 

8 amount of information that can be exchanged about the agent in the open marketplace, out 

9 of the agent's control. A basic premise in the system of Secure Data Interchange is to keep 

10 control of data. We provide methods that allow an agent to release data, or performing 

1 1 processing on data, to one agent but prevent that agent from selling the data on to another 

1 2 agent with which agent A also interacts. For example, one key technique is for agent A to 

1 3 use a unique pseudonym with every agent that it interacts. This allows an agent (so long as it 

14 is careful not to release other identifying information) to release profile information to other 

15 agents without losing the value of that information, because there can be no secondary 

16 market in the agent's profile. If agent A provides information to agent B, then agent B 

1 7 cannot pass that information onto another agent C and have it still linked to agent A because 

1 8 agent C does not know the identity of agent A among the agents that it interacts with. 

19 An agent's profile contains any and all data that an agent might wish to exchange 

20 with another agent when it assumes a particular identity. However, just because the data is 

21 in the profile it does not mean that it is available to another agent. Possible information in 

22 the profile of an on-line consumer includes: transactions that it has performed with other 

23 agents; information that relates to its true identity (e.g. salary range or education level); 

24 information provided by the agent (such as its preferences for a particular type of product, 

25 etc.); and other information that has been compiled based on observing the behavior of the 

26 agent (e.g. physical location for a mobile user, such as a user in a vehicle, or trace of recent 

27 web pages visited for a user that is browsing the Internet.) 

28 SDI-Proxy: An SDI-proxy refers to the software that runs on top of an agent's computer 

29 device, and configures that device for Secure Data Interchange. The proxy intermediates 

30 interactions between pairs of agents, and also intermediates transactions between agents and 
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1 the SDI data server. For example, an on-line consumer might define a profile-management 

2 policy at the client-side SDI proxy that automatically configures the agent's profile and 

3 identity when as the agent interacts with other agents. The profile-management policy 

4 implements an appropriate policy to select the user's profile and identity on the basis of the 

5 information that is available about the other agent (for example from certificates). 

6 1 .2 System Architecture The basic architecture for SDI is a system of agent 

7 computers, connected via a network (Internet, wireless, or otherwise) to other agent 

8 computers, and with a central SDI shared data warehouse. In Figure 1 we illustrate the top- 

9 level architecture of Secure Data Interchange, the networked system of agent computers and 

10 a centralized server computer that acts as a repository for data, rules and code. This is called 

1 1 the SDI data warehouse. Each user is associated with an agent computer, and in general 

12 users can be individuals, groups of individuals, or companies. In its most general form, the 

1 3 system of Secure Data Interchange is for a system of multiple autonomous agent computers, 

14 involved in multilateral communication. We restrict the system description to bilateral 

15 communication between agents, without loss of generality because any multilateral 

16 (multicast or broadcast) can be implemented as a set of bilateral communications. In any 

17 bilateral communication there are two parties, the sender and the receiver (and the parties 

1 8 can dynamically change over a communication session). 

19 SDI proxy agents and profile management policies on agent computers mediate 

20 bilateral agent interactions. Agents define profile-management policies that are implemented 

21 on agent computers and determine appropriate profiles and identities for an agent in 

22 interactions. Every time an agent initiates a new interaction, new information is available 

23 about the agent, that can be compiled by the agent's own computer device, and also by the 

24 computer device of the agent with which it interacts. Careful profile and identity 

25 management provides an agent with absolute control over the ability of other agents to 

26 profile the agent and exchange information about the agent, for example the agent can use a 

27 unique pseudonymous identifier with every other agent. However, the core of the SDI 

28 invention is that we encourage agents to exchange information, by providing a secure 

29 central data interchange for that purpose. Agents can submit profile information, and other 

30 data, to the central data warehouse and make it available for particular types of data mining 

3 1 by other agents, and receive financial reward for providing data. Products and services can 
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1 also be targeted for agent A on that agent's computer, without the provider agent receiving 

2 information about agent A's profile. For example, vendor B might provide generic 

3 information about its products to agent A, and a personalization rule that it has constructed 

4 from data mining the central SDI data warehouse. The agent computer of agent A can then 

5 use its local and private profile information to decide what products to target to the agent, 

6 based on the instructions that it receives from agent B. 



7 The arrows indicate possible flows of information between the different elements 

8 within the system. We allow agents to communicate directly, via bilateral or multilateral 

9 communication, and also indirectly via the central data warehouse. Information can be 

1 0 submitted to the central data warehouse, for example new data and new rules, or queries that 

1 1 the data warehouse will execute. The data warehouse returns information to agents, for 



12 example the results of a query. The Secure Data Interchange invention is independent of the 

1 3 implementation details of the communication platform. We assume that the system of SDI is 

14 built on top of a secure communications platform, for example via a SSL-encrypted TCP/IP 

1 5 session in an application to the Internet. Furthermore, we draw on cryptographic techniques 

16 known in the art for identity management, and additional techniques to support 

1 7 pseudonymous identities within a particular network protocol, e.g. the stripping of sender 

1 8 URL addresses from HTTP packets on the Internet. 



19 In overview, the top-level claim in the system of Secure Data Interchange, of a 

20 privacy-protected market-based system for data exchange between self-interested parties, is 

2 1 constructed from the following core components: 

22 o A Central Data Warehouse, with associated price-rules and constraints submitted by 

23 the owners of data. Architectural variations allow virtual links to data in the central 

24 data warehouse, with data physically located in distributed locations. 

25 o A Query-execution and Price module, which executes queries on the data within 

26 constraints on the types of queries that can be performed, and computes the price of 

27 queries, collecting payment. 

28 and the following optional components: 



Page 18 of 370 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 o Distributed data management, via client-side SDI "proxy" agents, that intermediate 

2 the interactions between agents, and manage data provided to the central SDI data 

3 warehouse. Distributed data management includes methods to manage an agent's 

4 identity in interactions with other agents, for example via pseudonymous and 

5 anonymous interactions. 

6 o Distributed query-execution, via client-side SDI query-execution modules, that 

7 allow data that is physically stored in distributed locations to be queried on 

8 distributed nodes, without an explicit release of the data. 

9 o Data perturbation methods to augment data values with randomized noise, to allow 

1 0 queries to tradeoff price and quality, and to allow a user to protect his/her identity by 

1 1 hiding revealing details of submitted data elements. 

1 2 In describing key applications of the system of Secure Data Interchange, we describe 

1 3 in later sections additional features that extend the functionality of the invention: 

14 o Community dollars: an extended payment scheme that allows extended forms of 

1 5 payment within the system, for example allowing payment to be made in terms of 

16 discounts in purchases from particular vendors, or in return for agreements to 

1 7 purchase a certain number of products over an extended period of time. 

18 o Within an Internet-browser based system, methods to control a user's browsing 

19 experience via a personalized portal, where personalization is performed via data 

20 mining techniques executed by the system of SDI on data in the central data 

21 warehouse. 

22 o Methods to support anonymous and pseudonymous electronic commerce, e.g. 

23 delivery and payment services. 

24 In the next section we provide an overview of each of the central SDI data 

25 warehouse, which forms the core component of Secure Data Interchange. We then add 

26 technical details for specific components, to make our ideas more concrete. 

27 1 .3 Implementation Details 

28 In this we describe some of the core technologies known in the art that would be 

29 used to build up an implementation of the SDI system. The technologies span areas in 

30 cryptography, for pseudonym management, digital certificates, payment mechanisms, 
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1 etc.; and information theoretic-methods, for example to protect the identity of the 

2 originator of a message by routing messages through local "crowds" of agents. 

3 1.3.1 Certificate Management 

4 Cryptographic techniques well-know in the art [Chaum 81; Chaum 85; Chaum 91] provide 

5 the ability for a certificate to be linked to an agent's identity, and not transferred to other 

6 agents. This is the basic functionality required of a certificate management system. For 

7 example, a certifying agency can sign the public key of an agent with the private key of the 

8 certifying agency, to indicate that the agent satisfies requirements for certification. Another 

9 agent can verify the certificate with the public key of the agent and the public key of the 

1 0 certifying agency. The certificate cannot be transferred to another agent unless that agent 

1 1 assumes the same public key. We assume a public key infrastructure to manage this process. 

12 1 .3.2 Support for Anonymous and Pseudonymous Identities We have already noted that it 

1 3 is important to provide whatever additional support is required because of the underlying 

14 communication infrastructure to protect agents' profile management policies. For example, 

15 in the TCP/IP mechanism a message must be stripped of the network address of the 

16 originating Internet server, because this can provide information to allow pseudonyms to be 

17 linked. Similarly, messages can be routed through a common gateway or random 

18 "forwarders" as in the "CROWDS" system [GGMM98; RR98] to provide pseudonymity. 

19 Furthermore, other e-commerce functions, such as payment and the anonymous mailing of 

20 goods must be supported (see [INSERT A FORWARD REF TO A LATE SECTION OF 

2 1 BOTTOM-LEVEL SDI DOC]). 

22 The ability to embed data within web pages allows client-side processing of 

23 information. By embedding profile and location information directly within a web document 

24 we can alleviate the bandwidth and computational bottlenecks that can occur at a centralized 

25 profile server if profiles are fetched on-the-fly when web pages are downloaded by clients. 

26 The origin server (supported by the vendor) requests periodic profile updates from the 

27 central SDI server. This duplication of information enables the profile and the page contents 

28 to be provided directly from a vendor's server. 
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1 There are some potential drawbacks of this approach: (1) the profile information 

2 associated with a web page and target objects can be out-of-date; (2) the profile information 

3 is available to all clients and proxy servers, not just those that are SDI-enabled; (3) the 

4 profile information can be altered. We suggest technical solutions to each of these problems 

5 below. 

6 In one variation of SDI the profile of a user is maintained on the user's client, and 

7 partitioned into separate profiles for each pseudonym that a user chooses to maintain. 

8 Personalization of products and services (product types, prices, etc.) is performed at the 

9 client, through the execution of trusted code that is embedded as a Java applet or as 

10 JavaScript within the web document of a vendor. In this way a vendor never receives access 

1 1 to the profile of a user, but is nevertheless able to personalize its response to users, even 

12 when a user first visits a site (on the basis of the profile for a user from his/her previous 

13 online transactions). Profiles for the target objects of a vendor that enable appropriate 

14 objects (representing particular products, or news stories for example) to be presented to a 

1 5 user are embedded as XML data within the vendor's web document. 

1 6 In another variation of SDI personalization is not performed at the client, but either 

17 at the ISP-level SDI proxy server or the vendor's server. The location and other profile 

1 8 information that relates to a user are pushed to the ISP-level proxy or vendor server when a 

1 9 user requests a web page. In the same way as XML allows profile information about web 

20 sites and vendor products to be associated with a web document, and profile information to 

21 be provided from the central SDI server to a vendor, XML can be used to encode a user's 

22 profile. The system of SDI allows for profile and location information to be randomized 

23 slightly (and even anonymized) to protect the identity of a user, for example when an ISP- 

24 level proxy is not trusted. 

25 1 .3.3 Maintaining the Integrity and Security of Messages The privacy of 

26 information in transit between servers and clients can be assured through standard end-to- 

27 end cryptographic solutions that establish a secure session prior to any data exchange, such 

28 as Secure Sockets Layer (SSL) that uses X.509 certificates and is supported by current 
2 9 browser technology. 
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1 In order to prevent the possibility of individual users being bribed by vendors to 

2 disclose target object profile data which reflects this type of information, users should not be 

3 provided access to directly decrypt the metatags for these portions of the target objects 

4 profile data, but rather this decryption and release of profile data should be performed 

5 securely in conjunction with the functions of the profile processing (profile matching 

6 module) upon the client level proxy server rather this decryption and release of profile. 

7 In addition, we prevent unauthorized access of embedded profile information 

8 through the encryption of the metadata that is represented within the XML structure of a 

9 web page. 

1 0 Profile information can be encrypted using a hierarchy of keys, so that different 

1 1 levels of access to the information may be provided according to the access levels of users 

1 2 and vendors. All users that request web pages from SDI-enabled vendors, whether or not the 

13 user is a member of SDI receive the same profile information. We provide encrypted 

14 profiles to vendors in the 'Profile Update' messages from SDI to vendor servers, so that: (a) 

15 unauthorized agents cannot tamper with the profiles; (b) the profiles cannot be read by 

1 6 unauthorized agents. 

1 7 The SDI system supplies a private key to trusted SDI client software, that enables 

1 8 only SDI-enabled clients to access profile information, and only access that information to 

19 the extent permitted by privacy policies of users and vendors. Different levels of encryption 

20 enforce multiple levels of access. Periodically the key pairs are changed to prevent 

21 extended attempts at cryptographic attacks. The SDI system uploads the key that provides 

22 the correct level of access for a user to a user's client, once terms of access and profile 

23 management have been agreed. A client can only access embedded information once 

24 enabled with a relevant key. Finally, profile information is signed with a digital certificate, 

25 to prevent third parties from tampering with profiles for commercial gain. 

26 2. Core ModulesFigure 2 illustrates the core modules within the system of Secure Data 

27 Interchange. In this section we briefly describe each module in relation to the other modules, 

28 and provide more details in later sections. 
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1 The central SDI data warehouse is the core system in SDI, managing data records on 

2 behalf of agents. In combination with the query-execution module, these two modules 

3 implement the privacy-protected market for query-execution, where queries are executed: 

4 o If the querying agent has the right certificates, as defined by the owner of data 

5 o (and) If the querying agent pays the cost of performing a query 

6 Furthermore, the system of SDI implements an internal market, with queries 

7 executed as cheaply as possible, within quality constraints provided by a querying agent. 

8 The eBank module in SDI collects payments, and allows those payments to be transferred to 

9 external bank accounts, or used in part payment for products and services to SDI-enabled 

1 0 vendors. 

1 1 We allow the central SDI data warehouse to be replicated and/or partitioned across a 

12 network, for example a small single-user SDI data warehouse can be situated at a user's 

13 client machine to allow vendors to provide personalized products and services without 

14 releasing personal information (e.g. profiles). The rules that personalize products are 

1 5 queries, which can be executed in the SDI proxy on a user's machine. 

16 The central data warehouse acts as a repository for data submitted by agent 

1 7 computers on behalf of users. Consider, for example, data that represents user profiles (i.e. 

18 what types of books they like to read, what there political interests are, etc.), or data that 

19 represents business rules (i.e. what type of customer a business is seeking, what types of 

20 products it offers, what type of strategic agreements it is looking to make with other 

21 businesses, etc.) 

22 The data repository contains information submitted by agents to the central data 



23 warehouse. It is not necessary that a single agent links all information submitted by one 

24 agent, because agents can submit information anonymously or pseudonymously. Although 

25 agents submit data to the central shared warehouse they maintain key aspects of ownership 

26 of that data, but without minute-by-minute management of the data. In particular, providers 

27 of information provide the central SDI warehouse with a certain amount of autonomy to 

28 control how the information is used: the types of uses that are permitted, the type that are not 

29 prohibited, and what types of rewards (financial or otherwise) are required for access to the 
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1 information. The system of SDI, in the access that it allows to information in the database, 

2 acts as a trusted agent for information. One role of SDI is to protect the identity of an agent 

3 that requests and provides information — allowing agents to remain anonymous when that is 

4 desirable. We outline in this description the key technical solutions with which this is 

5 achieved. 

6 The data is heterogeneous, best represented within an object-oriented database 

7 system. Each data object is associated with an accessor function, which describes the syntax 

8 of valid queries on a particular type of data, and executes queries. As data types become 

9 standardized, we could suppose a library of standard accessor functions for different data 

10 types, perhaps provided by third parties. For example, a special data type could be 

11 developed to represent the profiles of online customers. As part of query execution 

12 therefore, is the execution of methods associated with heterogeneous types of data. Of 

13 course, we can also describe a simple special case where the data types are more transparent 

14 to SDI, and data fields are directly indexed as in standard databases. 

15 Agents can submit queries to the SDI data warehouse, these are placed in the 

1 6 pending queries queue if they are one-time and to be executed as quickly as possible, or in 

17 the persistent queries queue if they are to be executed periodically, as the data in the 

18 repository changes. The query execution module contains the processor that performs 

1 9 instructions on behalf of agents, within the constraints of price rules. 

20 Price rules are associated with data records. Price rules compute a charge for 

21 requested information about the data record. We describe a number of simple price 

22 languages, to allow an owner of information to configure access to the data. In general the 

23 price rules can be quite complex, and implemented within an object-oriented framework, so 

24 that the system of SDI is expandable to new data types. The query-execution module can 

25 execute both the methods to access data in objects, and the methods to price access to data. 

26 In performing query execution, we suggest some techniques to minimize the price paid to 

27 perform a query — for example we implement a simple internal marketplace where the data 

28 records with the smallest price are selected, with all other things equal. 
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1 As a special case, prices can implement the standard capability-based access control 

2 systems, such as those in UNIX, where a user must possess the necessary properties to read 

3 information. A price $0 corresponds to permission to access information, while a price 

4 $infinity prevents a user from reading information. The price rule computes a price for 

5 accessing information based on a number of factors: the properties of the requesting agent, 

6 as demonstrated with certificates; the information requested; and whether the identity of the 

7 owner of the information is revealed to the requesting agent. SDI enforces the price rules, 

8 with payment collected from agents before the results of queries are reported. SDI allows an 

9 agent to contribute information to the shared database but maintain control over access to 

10 the data. Price rules can also specify different prices based on the degree of random 

1 1 perturbation that is performed on data before a query is executed. This can protect the 

12 identity of the owner of information but still allow valuable information to be provided in 

1 3 response to a query. 

14 The system of SDI also expands the possible space of agreements through 

15 anonymity techniques and through random perturbation of data. Agents can provide 

16 information anonymously or pseudonymously, or with their true identities revealed. 

17 Furthermore, agents can charge less for information provided anonymously than for 

1 8 information provided under a revealed identity. Meta-information associated with data can 

19 also specify whether the information has been certified by a third-party, or whether it has 

20 been randomized slightly before placing in the SDI data warehouse. 

21 The data repository is linked to the query-execution module, that performs queries 

22 that are queued to be executed in the pending queries queue. We allow queries to be general 

23 programs, an extension from the simple queries that are found in SQL-based database 

24 languages. The key novelty is that the query-execution module can maintain intermediate 

25 results, for example pointers to records selected as part of intermediate queries. In a standard 

26 database there is more of a separation between data access and data processing, with data 

27 accessed, pulled outside of the database, and then processed. We allow data to be accessed 

28 and processed in the database, with results pushed to clients. This is has useful privacy 

29 properties, because less information is finally provided in response to a query. It is very 



Page 25 of 3 70 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 useful in the space of secure data interchange because it expands the set of agreements that 

2 can be reached between providers of information and requestors of information. 

3 Data manipulation is via record pointers, which are one-time and anonymous 

4 pointers to data records. We associate a single query (possibly a complex query, i.e. a query 

5 program) with a query session. A record pointer is a temporary identifier for a record that is 

6 selected. The pointer allows persistent queries to be performed on the same record during a 

7 single query session, but is not valid in other sessions. The same data record might be 

8 accessed multiple times during a single session, and with multiple record pointers when the 

9 accesses are independent (i.e. when the querying agent does not know that it is requesting 

10 information of the same data record). Provisional payments are tallied against each record 

1 1 pointer, and relate to the degree of information provided about that record. For example, 

12 consider the problem of finding a set of profiles of a particular type. As the search is 

13 performed over data records in the database the query-execution module assesses a 

14 provisional payment, as records are accessed. However, the requestor of the information 

1 5 only pays the sum payment charged by the records that are actually selected as suitable. The 

16 owners of the data records that were not selected in the final result returned to the user 

17 receive no payment, because no information about their data was released and the 

1 8 information was only used in intermediate processing steps. This is important, because it 

1 9 makes it unnecessary to formulate queries carefully in order to avoid extra cost because of 

20 redundant intermediate steps. 

21 Complex queries, including query-programs, with intermediate results and 

22 anonymous record pointers expand the space of data manipulation in a system with 

23 providers of information that are more willing to release information if that information is 

24 never made directly available to another agent except in aggregated system-wide terms. An 

25 agent might be willing to allow submitted data to be used for data mining applications so 

26 long as the data remains in the trusted data repository. The central SDI server can provide 

27 standard types of query-programs, stored in the SDI-methods module, for example methods 

28 to perform collaborative filtering. 
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1 The query-execution module also performs persistent queries that are submitted by 

2 agents to be executed when particular conditions in the data repository are true. Persistent 

3 queries check for certain information to reside in the data warehouse, and notify a requesting 

4 agent whenever these conditions exist, perhaps automatically making an action. 

5 The pricing module is responsible for pricing queries before execution. The basic 

6 problem is to aggregate the charge over all data records that provide information to form 

7 part of the result of the query, and to estimate the price of a complex query without 

8 performing the query. We suggest a simple top-level query language to allow a user to price 

9 quality-cost tradeoffs in the query that is finally executed. For example, statistical techniques 

10 can be used to compute aggregate statistics without accessing all data records in the 

1 1 database. To give another example, a query can be performed on the basis of accurate 

12 information or on the basis of slightly randomized information. The pricing module reports 

1 3 a price to a requesting agent, and then allocates payment to appropriate agents in the e-Bank 

14 if and when a query is actually executed for a user. 

1 5 The e-Bank is a module that maintains a balance of payments received for each data 

16 entry in the warehouse, one balance for each of the unique SDI identifiers with which an 

1 7 agent submits information. 

18 There can be multiple accounts for a user in the case that the same person or 

1 9 company submits information under multiple IDs (see below). 

20 The communication module receives messages from agents. Agent messages can be 

2 1 updates about information or access rules in the data warehouse, information received from 



22 the owner of data. Messages might also be new queries. The module also sends messages to 

23 agents, for example messages that indicate the results of an instruction are available, or to 

24 send messages that are generated by instructions from other agents and consistent with the 

25 rules associated with an agent's profile. The module also chooses an action to perform based 

26 on messages received, i.e. processes incoming messages and either discards them, places 

27 them in the appropriate location if they are a query, or updates the data and rules when a 

28 message contains new data. 
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1 The following sections describe each of the key components in turn. 

2 3. Data Repository 

3 The data repository in the SDI data warehouse is a general-purpose database that 

4 stores information submitted by agents. Multiple agents submit information, each of which 

5 retains control over access to their data. Access to information is controlled via the price 



6 rules that an agent specifies when providing information. The SDI query execution module 

7 enforces price rules, collecting revenue from requesting agents before providing results. The 

8 system of SDI allows users to receive economic benefits from information, and realize 

9 synergies — in a controlled environment. SDI implements an internal market for 

1 0 information, such that data records with the best price are used in queries, given a number of 

1 1 records that satisfy other requirements. 

12 For example: SDI can be used to store the profiles on online customers, to allow 

13 vendors to better personalize services and identify market opportunities and to allow 

14 customers to receive value for their profiles; SDI can be used to store job offers and job 

15 applications, where job offers can contain salaries that are only revealed to applicants with 

16 suitable qualifications. 

17 SDI is designed to allow users to submit many different types of information, from 

18 many different sources, and for many different purposes. The application is particularly 

19 useful when it becomes a de facto data repository for lots of different types of information, 

20 information that can in fact be analyzed for the purpose of extracting patterns and other 

21 useful data (e.g. with collaborative filtering techniques). However, we provide three key 

22 variations in the following description, which are not necessarily exclusive. 



23 1. A single unified database, with information residing on a central server (or a 

24 number of central servers). We describe two possible implementations for such a 

25 heterogeneous database, one XML-based and another object-oriented model. 

26 2. Multiple databases, partitioned into "data types". For example, one SDI database 

27 might contain profiles on online customers, while another database might contain 

28 information about job offers and job applications. This type of database can be 
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1 implemented using a standard "indexed-field" representation. Again, the 

2 information resides on a central server (or a number of central servers). 

3 3. Distributed data, with some data residing on a central server, and some data 

4 physically located on distributed servers (e.g. on the servers of vendors that 

5 subscribe to SDI), but with virtual "hyper links" from the central server to give 

6 one unified view of the data. 

7 The key over-riding problem is one of data representation, the way in which 

8 information is encoded. We describe a general infrastructure for information exchange, and 

9 do not mean to limit the description to any one type of data. Furthermore, the system of SDI 

10 should be expandable, so that new data types can be introduced. In this specification the 

1 1 type of information in the Secure Data Interchange shared data warehouse is not constrained 

12 in any way, but may include for example user profiles (e.g. preferences, recent purchases, 

13 etc.) or business services (e.g. costs for services, service capabilities, etc.). 

14 One approach is to use a single native SDI ontology, which is expanded as 



1 5 necessary. It would be the responsibility of providers of information in alternative forms to 

16 provide 'translation services" to convert local data formats into the SDI native data format. 

1 7 A typical technology to support a shared ontological representation is XML (Extended Meta 

1 8 Language), which allows a grammar to be defined for a document, with meaning embedded 

19 in tags. The trend towards XML-based applications should facilitate a shared ontology 

20 structure, and allow metainformation to be associated with information and describe data. 



2 1 XML allows intelligent integration of data from multiple databases. 

22 Alternatively, we can allow data to be stored in heterogeneous formats across a 

23 single unified database, within an object-oriented infrastructure. Each data object has a 

24 "wrapper" that controls access, and provides an interface for queries. When a query is 

25 executed, the method is invoked, and the result computed with the method and the data. 

26 This is perhaps more efficient than the aforementioned approach, in that efficient data 

27 formats are query structures are retained. 
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1 3.1 Data Structure 

2 In this section we describe the structure of the records in the SDI database. The next 

3 section describes the interface that allows user agents to submit data and update information. 

4 The data repository has the following key features: price rules are associated with 

5 information, to control access to information on the basis of information requested and 

6 properties about the requesting agent; meta-information to specify additional information 

7 about data records, for example has the information been certified by a third party. 

8 We describe the abstract structure of data records in the data repository. The 

9 semantics of the data field and the price rules depend on the type of information. All data 

1 0 records, whatever the information that they represent, contain the following elements: 

11 1. Owner ID. 

12 The owner ID is a three-tuple (Public Key, SDI identity code, Remote address). The 

13 public key is provided by an agent that submits data, and is one half of a 

14 public/private key pair in a public key based cryptographic infrastructure. The public 

15 key is used to provide authentication of the agent, in case it wants to amend the 

16 record in the future. The agent can submit a message signed with its public key to 

17 prove its identity. An agent can use a different public key for each alias that it 

1 8 maintains within SDL The SDI identity code is a unique code, generated by SDI for 

19 each agent alias, and provided to an agent to allow the agent to access the eBank and 

20 other ancillary SDI services, for example to collect payments received for access to 

21 the data. The remote address is an (optional) contact address for the agent that 

22 submits the information, for example an email address to a pseudonymizing module 

23 that will forward email to the agent under a number of different aliases. 

24 2. Record ID. 

25 The record ID is generated by SDI, and is used in the case that a user creates a 

26 number of different data records with the same owner ID. 

27 3. Data object. 

28 As noted above, we allow data in SDI to be of different types, and a general data 

29 object might be represented in an internal coding that is not known to SDI. Such an 
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1 object must provide an accessor function to allow queries to be performed. The 

2 query-execution module takes an object and invokes the accessor methods to perform 

3 a query. In the most general form, a data object is represented as a three-tuple: (data 

4 type, data field, data accessor). The data type specifies what the type of data is, for 

5 example is it a user profile or a business rule. The data field specifies the data, and 

6 can be a private record that is only accessed by the accessor, which provides an 

7 interface to allow queries of the correct format for the data type to be performed. 

8 In a simple special case, with data types designated by SDI centrally and data 

9 indexed with fixed fields, a data object is more appropriately represented as an 

10 association list between field names and values, e.g. (Fieldl, Valuel), (Field2, 

1 1 Value2), etc.. 

12 4. Price rules. 

1 3 The price rules provide the owner of information with control over the type of queries 

14 that can be performed on a particular data record. The price rules compute an ask 

15 price for a proposed query, on the basis of three pieces of information: what type of 

16 information, and what accuracy of information is requested; what certificates can the 

17 querying agent present; can the information be provided anonymously or must the 

18 identity of the owner of the information be revealed? As special cases, a price $0 

19 corresponds to "access is possible for free", and a price $ infinity corresponds to "no 

20 access is possible". The unit of currency need not be US dollars, but can be any unit 

2 1 of currency, or as we also suggest in one useful variation "Community Dollars" that 

22 are SDI-specific dollars which can only be spent with vendors that are registered with 

23 SDI, and can also be restricted in various ways. As a special case, the prices allow a 

24 simple capability-based security system, because they can be set to $0 or $infinity on 

25 the basis of certificates owned by an agent that requests information. The price-rules 

26 are described in more detail in the next section. 

27 5. Meta-information 

28 The meta-information associated with a data record is expandable, but at present we 

29 suggest the following pieces of information: is the data is randomized, and is the 

30 information certified? The method of random data perturbation is fully described later 

3 1 in this document. Certification can be provided by third-parties, who can verify for 
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1 example the age or nationality of an individual represented with an electronic profile. 

2 We provide more information on metainformation below. 

3 3.2 Price Rules 

4 The price rules that can be associated with a data element are described in the next main 

5 section, the Query Execution Module section. The user associates a rule with data that the 

6 system uses to compute the price of a query during query execution. The price is further 

7 used within SDI to operate a data marketplace. Again, this is described in the next section. 

8 The user will eventually pay the total price for its access to all data elements used to 

9 compute the final response to a query; for example negative responses do not incur a price, 

10 it is only data that actively makes it into the process of computing the result of a query that 

1 1 matters. 

12 We describe in the Query-execution module section the methods that are used to 

1 3 determine which data elements are used to compute the final response made to an agent, and 

14 therefore to compute the sum price for the final query. An agent is not charged for every 

1 5 access to data elements made during intermediate stages of executing a query. For example, 

16 if a query requests information about ten data records with high value associated with a 

17 particular field, the price of the query is the cumulative price for the access to the ten data 

18 records returned in response to the query, and not for all the data records queried in 

1 9 determining the ten records to return. 

20 In general, a query of a data record can be part of a larger query session, as discussed 

21 in the section on query execution. Within a larger query session, the same data record might 

22 be queried a number of times, for example with a large compound query split into a number 

23 of steps with continual execution contingent on continued correct responses. E.g. Select 

24 records of type A, then from those records select records of type B, then from those records 

25 select records of type C... In this case, because price rules might be non-linear in the 

26 amount of information provided, it is important to provide a price-rule method with 

27 information to allow it to track a sequential query. 

28 As discussed in more detail in the next section, the system of SDI handles this by 

29 generating temporary and anonymous pointers to data records, that are valid only for a 
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1 single session, and allow a price-rule to track sequential queries. The data record pointer 

2 allows the history of queries to be recovered, and a new price to be computed on the basis of 

3 total information provided. Similarly, this is important when a query might collect a lot of 

4 information about a data record and then request the identity of the agent that provided the 

5 information to SDL The cost of revealing an agent's identity (and therefore allowing a 

6 secondary market in its information) might well increase with the amount of information 

7 that has already been released. 

8 Similarly, in some cases the same data record might be accessed on multiple 



9 occasions, but independently, such that the environment performing queries does not know 

1 0 that the data record is the same record from before. E.g., select records of type A and extract 

1 1 information with rule B, then select records of type C and extract information with rule D. A 

12 single record can be of type A and of type C, and therefore be selected for information 

1 3 extraction in both cases. In this case, each independent sequential query (i.e. A, B and C, D) 

14 has a set of data record pointers, so that the cost of extracting information is computed 

1 5 independently for any record that has type A and type C. 



16 Here is a simple example of a non-linear pricing function, that accounts for 

17 cumulative information that has been provided to a requesting agent. Suppose that any 

1 8 sequential query receives a temporary and anonymous pointer to a data record, that is only 

19 useful in the current query session. The data record pointer allows the price rule to 

20 implement a non-linear pricing rule. For example, consider the query 'what is the value of 

21 field A, B and C\ split into queries 'what is the value of field A', then 'what is the value of 

22 field B\ then 'what is the value of field C? The pricing rule might state: 

23 $0. 1 for any one of A, B or C 

24 $0.2 for any pair of A, B and C 

25 $10 for all of A, B and C. 

26 In this case, by tracking the data record pointer with which a data record is 

27 accessed, the price of the first two requests can incur an incremental charge of $0.1, 

28 while the price of a third request can incur an incremental charge of $9.80. 
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1 At a per data-record level the price of a query depends on the response made to 

2 the query. This is important because an affirmative answer to the question 6 are you the 

3 president of the united states' carries more information than a negative answer. This has a 

4 slightly undesirable side-effect, in that when estimating the price of a query in the pricing 

5 module, the estimated price of a query might leak information about the result, even 

6 without performing the query. However, we believe that the benefits of linking price to 

7 information content in a query outweigh this potential loss in value of information. Note 

8 in particular, that in general the ask prices associated with data records are private 

9 information and independently set, and therefore are not very revealing, especially within 

1 0 a competitive market place. 

1 1 3.3 Data Submission/Update Methods 

12 In this section we describe the basic methods to register with SDI, submit data, 

13 update data records, and access payments collected by SDI as information is queried. We 

14 use "agent" to refer to the computer system that interacts with the central SDI data 

15 warehouse, submitting data and requesting payment from the eBank. Agents may represent 

1 6 individuals, vendors, or other self-interested parties. 

1 7 Data records can be submitted under multiple aliases by a single agent, to provide an 

1 8 additional level of control and flexibility in managing data submitted to SDI. For example, if 

19 an agent represents an individual that is an online consumer, interacting with different types 

20 of vendors, then perhaps the agent will use two aliases: one for while the consumer is at 

21 work, and one for while the consumer is at home. Alternatively, the agent can maintain a 

22 number of identities for different activities, or interests of the user. In the preferred 

23 implementation agents, for example web-browser based client agents, can manage an 

24 agent's selection of identities as it submits data to the central SDI warehouse local to the 

25 user. The ability to submit information under multiple identities protects the ability of an 

26 individual to prevent another agent building a complete picture about its preferences and 

27 profile, while still allowing that individual to leverage as much of the value associated with 

28 its information as possible. The SDI proxy agent, situated on user's client machines, 

29 implements this functionality. It is described later in this patent. 
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1 A standard cryptographic public key /private key infrastructure provides a useful 

2 technique to implement a system in which agents can maintain different aliases. As 

3 proposed in the work of D. Chaum [Chaum 81; Chaum 85; Chaum91] a public key (PK) 

4 and private key (SK) serves a number of purposes. First, the public key acts as an identifier 

5 for the alias, a name. Second, the agent can compute a new private key/public key pair when 



6 it requires a new alias. The agent keeps the private key secure, and this provides a method to 

7 allow the agent to validate its identity, for example by cryptographically signing a message 

8 with the private key. The signature can be verified with the public key, and the keys can be 

9 selected with enough bits to make falsification a computational impossibility. This 
1 0 infrastructure is outside of the current patent, but standard in the art. 



1 1 As described in the JANUS/LPWA system [BGGMM 97; BGGMM 98; GGMM 

12 98], it is also possible to associate a public/private key pair with a pseudonymous e-mail 

1 3 address, to allow information to be pushed to an agent that owns information under its alias. 

1 4 The system is implemented via pseudonymous proxies which a user agent to poll and check 

1 5 for new messages. 

16 An alias can also be completely anonymous, but in this case the value of the data 

1 7 provided may be less in the internal market place implemented within SDI (in the query 

18 execution module). The convention for an anonymous alias is that the user agent continues 



19 to provide a public key, and use a private key for validation of its identity. However, in this 

20 case the public/private key is one-off and just for this data record, and no return address is 

21 provided. An agent that submits information anonymously can still recover payments from 

22 the eBank. 



23 When a data record is first created SDI returns a data ID, so that the agent that 

24 submits information can specify a particular data records in future, in case it creates a 

25 number of data entries in the central SDI data warehouse. Submitting a new data record to 

26 the SDI data warehouse is accomplished with the following semantics: 

27 (SDI identity code, SDI record ID) = SUBMIT( alias, data object, price rule, 

28 metainformation). 
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1 The following protocol is followed in the SDI data warehouse in response to a 

2 SUBMIT message: 

3 1 . SDI first checks the alias against its record of existing aliases. If the alias exists, then SDI 

4 first verifies that the alias is not anonymous (in which case it should only have one record), 

5 and then challenges the agent to sign a random message with its private key — to validate its 

6 identity. Once validated, the SDI identity code is returned to the agent. Otherwise, if the 

7 alias is not found, then a new SDI identity code is created. 



8 2. SDI then creates a new data record, with the data object, price rule, and 

9 metainformation provided by the agent, and then computes a new record ID, which is 
10 also returned to the agent as proof that the record has been created. This record ID is used 



1 1 to change the data in the future. 

12 The owner of a data record can change the record with the following rules: 

13 Ok = CHANGE(alias, record ID, data change) 

14 Ok = CHANGE(alias, record ID, price change) 

15 Ok = CH ANGE(alias, record ID, metainformation change) 

1 6 As with the SUBMIT command, first the SDI data warehouse checks that the alias 



17 exists, and challenges the agent that submits the CHANGE request to sign a random 

1 8 message with its private key, to validate its identity. Then the record ID is located, and 

19 again it is verified that the record is owned by the agent with the alias. Finally, changes 

20 are made to either the data record, the price rules, or the metainformation. If everything 

21 checks out, then SDI returns TRUE, otherwise SDI returns FALSE. 



22 Finally, an owner of data can remove a data record with the following command: 

23 Ok - REMOVE(alias, record ID) 

24 The checks on the identity of the agent that requests that a data record be deleted 

25 are made as for the CHANGE command. 

26 3.4 Meta-information 

27 An agent can associate meta-information with a data record, that can serve a number 

28 of different purposes. Data records can be tagged with meta-information, that can include 

29 but is not limited to, Has this information been randomized? Certificates relating to the data 
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1 record. For example, the meta-information might be a certificate from a third party about the 

2 integrity of information, or a certificate from an agent's client-side SDI proxy agent that the 

3 information in the record is unique, and not submitted under any other aliases by that agent. 

4 A key example of the role of a certificate is described later in the patent, where we 

5 explain how the client-side SDI proxy can provide a certificate to state that this is the only 

6 data record with information X. This is useful, because it allows an agent that submits 

7 information to maintain multiple records, but still provide a guarantee to querying agents 

8 that certain valuable information is not duplicated across multiple records. 

9 The meta-information may be associated with particular fields in the data 

10 associated with a data element, for example specifying that a particular piece of 

1 1 information has been randomized, or that a particular piece of information is highly 

1 2 sensitive and should be randomized before release. 

13 4. Query Execution ModuleThe query execution module is a key component of the 

14 SDI system. Its key functionality is: 

15 a. Implement an internal market for information as queries are executed, ensuring 

16 that information that is provided for a lower ask price is used in preference to 

1 7 information at a higher ask price. 

18 b. Compute the price of queries based on information finally provided in response to 

19 a query, by keeping track of the data records that are used to compute a result. 

20 c. Interface with the pricing module to allow the price of a query to be computed 

2 1 without executing the complete query. 

22 d. Interface with the data repository, performing queries either on fields with fixed 

23 index labels (i.e. within a traditional database framework), or with respect to an 

24 object-oriented framework with queries performed by invoking methods that are 

25 associated with data. 

26 e. Monitor conditions for the persistent queries, and provide a response to a query if 

27 conditions are satisfied. 

28 A central part of the invention of SDI is the method to compute the price of a query as it is 

29 executed. We described the semantics of price rules, which price access to data. They are 

30 defined by the owner of information. The SDI query execution module implements an 



Page 37 of 370 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 internal market for queries, and ensures that queries are executed at minimal cost to agents. 

2 The SDI query execution module is also responsible for collecting revenue on behalf of 

3 owners of information. 

4 We describe this methodology in this section, describing how the total price of a 

5 query is computed, as a sum of the price charged by the owners of data records that provide 

6 information which contributes to the final response. 

7 The query execution module follows a protocol to execute queries: 

8 1 . Request_For_Price(Query, AgentCertificates) 

9 2. Estimate Price with call to the Pricing Module 

10 3. Price_Quote(Quote_ID) 

1 1 4. Request_Query(Quote_ID) 

12 5. Get_Payment() 

1 3 6. Execute the Query 

14 7 . Report_Ans wer() 

15 In step (1) an agent makes a request for a query to be priced, stating the query, and 

16 providing certificates to allow the query to be priced. In step (2) the query execution module 

17 makes a call to the pricing module, and a price for the query is computed (described in the 

1 8 next section). In step (3) the agent receives a price quote, and can then decide whether or not 

1 9 to execute the query, and also a query ID. In step (4) the query execution module receives a 

20 request to perform the query, and then in step (5) requests payment from the agent. When 

21 payment is received the query is executed (6), and appropriate payment is credited to agents 

22 that provide information, scaled to make the budget balance as necessary (in the case of an 

23 over or under price quote). Finally, the response to the query is provided (7). 

24 4.1 Price Rule Semantics 

25 Each data record has an associated price rule that controls the price of accessing 

26 information. A data record can contain a number of different pieces of information, and data 

27 can be provided to different degrees of accuracy, so the price rule can be quite complex in 

28 general. In particular, consider a data record that represents the profile of an individual. The 

29 individual might be happy to have information released about some fields, for example its 
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1 ZIP code, or its recent book purchases, but less happy to have information released about 

2 different fields, for example its salary or social security number. Clearly, different pieces of 

3 information within a single data record require different prices. 

4 Furthermore, a price rule for the data record as a whole might need to be 

5 superadditive across data elements, such that it becomes very expensive to request too much 

6 information about data associated with the same user. One of the driving concerns behind 

7 the present invention is that at present it is possible for on-line vendors to collect 

8 information about a single individual via "cookies" (identifying codes which are left on a 

9 user's client machine), and form a portfolio of information about various activities and 

1 0 preferences of that individual. 

1 1 While information about an individual might be acceptable in small amounts, in 

12 large amounts the same information can soon become unacceptable. Furthermore, even if 

1 3 users release information anonymously in response to a query, if a lot of information is 

14 released the identity of the user can be compromised. Every additional piece of independent 

15 information that I state about my profile identifies myself a little more clearly, and acts to 

16 distinguish me from the profiles of other individuals. We discuss this further in a later 

1 7 section on random data perturbation, which describes how random noise can be added to 

1 8 data to counteract this effect. 

19 The basic idea is that the system of SDI allows a user to associate a price rule with 

20 every data record, that computes the price that a user must pay to execute a query over that 

21 data. We allow the price to vary, depending on properties about the requesting agent, the 

22 amount of information requested, and the level of identification that is required of the owner 

23 of the information in responding to the query. 

24 The abstract form of a price rule for a data record is a function: 

25 (Information-request x Certificates x Level-of-identification ) -> Price 

26 Given a request for information, a set of certificates that a requesting agent can present, 

27 and the level-of-identification that is required of the agent that provides information (i.e. 
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1 anonymous, or revealed identity), then a price rule computes an ask price. This is the 

2 price that an agent must pay to execute the query on the data record. 

3 However, as we describe in the next section, an agent only actually pays this price 

4 if the information provided is used to compute information in the final response provided 

5 to a query. For example, consider query "Select all data records close to record X". The 

6 query is most simply executed by computing the distance between each record and record 

7 X. Although a negative response from a record that is not close to X provides some 

8 information about that record, the information is not used to compute the information (i.e. 

9 set of records) that is finally provided in response to the query. 

10 As in the representation of heterogeneous price rules in SDI, the invention of SDI 

1 1 allows heterogeneous methods to compute prices for queries. In the most general case, 

12 we allow a price rule to be computed as a price method, which is invoked for a data 

13 record before a query is to be performed. The query-execution and pricing modules 

14 simply invoke the price method, and the data object provided by an agent returns a price 

1 5 for the query. In this most general version, the role of SDI is limited to providing the 

16 price method with the query, the certificates of the requesting agent, and the level-of- 

1 7 identification that is required. 

18 In a simpler variation the system of SDI can provide a number of default price 

1 9 rule languages, which allow a user to specify in simple but quite flexible terms a price 

20 schedule for queries. We describe two such rule languages: an uncertainty-based additive 

21 price rule; and a grouped additive price rule which is a simplified version of the general 

22 uncertainty-based rule. 

23 The uncertainty-based additive price rule allows a user to adjust the price for a 

24 query based on information about the requesting agent, the amount of information 

25 requested, and whether the information is required anonymously or with a revealed 

26 identity. It does not allow a non-linear coupling across the price of information about 

27 multiple attributes, but instead includes a simple upper-bound on the amount of 

28 information that can be requested. Such a non-linear coupling could be added with simple 

29 interaction terms between the prices on elements, for example if "information about more 

30 than half of the elements in this set is released, then add price term L". 
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1 4.1.1 An Uncertainty-Based Additive Price Rule 

2 The uncertainty-based additive price rule allows a querying agent to present a 

3 certificate to prove that it is entitled to query a user's data for free, or prove that it is 

4 entitled to receive data at a price. If a querying agent can present no certificate from 

5 either set, then it is not allowed to query the data. 

6 Then, a user can define two different price rules, one for data revealed 

7 anonymously, and one for data revealed with an identity. A price rule computes a price- 

8 term for each data element that is queried, with the total price for a query computed as the 

9 additive sum of all single element terms. The price for a query on an individual element 

10 is a linear function of the uncertainty with which the information is provided. For 

1 1 example, if a query requests the exact value of an element there is zero uncertainty, if a 

12 query requests a range of values for the element, or a 'value + random perturbation' there 

1 3 is some residual uncertainty, and we allow a user to associate a cheaper price with this 

14 result. The uncertainty is normalized with respect to the population of data elements in 

15 the SDI data warehouse, e.g. an uncertainty of 10% indicates that the reported 

16 information is consistent with 10% of data elements in the database. 

17 The price-rule is parameterized with the following information for each data 

1 8 element (or set of data elements, if a user chooses to associate data elements with a type): 

19 o A set of certificates that allow a query to be performed for free. 

20 o The price of an accurate response 

21 o A function to compute a discount from the accurate price, based on the 

22 uncertainty of the response. 

23 There are many possible discount functions, that include but are not limited to the 

24 following: 

25 © A linear price discount functions, such that 100% uncertainty corresponds to a 

26 discount equal to the accurate price. In this case, the user must only provide the 

27 price for an accurate response. 

28 © A step price discount function, where the level of discount is associated with a set 

29 of discrete uncertainty ranges, e.g. <10% no discount, 10 — 30% receives a 20% 

30 discount, and 30-100% receives a 70% discount. 
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1 o A quadratic price discount function, where the level of discount is associated with 

2 a quadratic polynomial function of the uncertainty, in this case a user needs to 

3 specify a number of intermediate discount points and SDI can compute a best-fit 

4 quadratic function. 

5 Stage 1 : Present a Certificate (Price-discrimination) 

6 First, the querying agent can present a certificate to allow it to query information for free, or 

7 to allow it to query the data for a payment. If the agent can present no such certificate, then 

8 it is not allowed to access the data. 

9 Assume that the requesting agents falls into one of three classes: it presents 

10 certificates to make its access free, it presents certificates to make access possible at a 

1 1 price, it presents certificates to make access impossible (infinite price). Let Free denote 

1 2 the set of certificates that allow a query to be performed for free, and Charge denote the 

13 set of certificates that allow a query to be performed for some charge. The failure to 

14 present certificates in either class indicates that an agent is not permitted to query the 

1 5 information. We do not suppose that the presence of a certificate can indicate a negative 

1 6 property, because there is no method to force a querying agent to provide a certificate. 

1 7 We also assume that agents must present the same certificates to access all data elements. 

18 It is trivial to relax this assumption. 

19 Stage 2: Specify the type of information required. 

20 Second, the querying agent specifies whether it is required that the information be 

21 provided along with the identity of the owner of the information. Assume that the agent's 

22 identity is either revealed (with a real or pseudonymous identity), or anonymous. The 

23 price required for particular types of information will vary with the level of identification. 

24 Stage 3: Select the appropriate price rule; Compute sub-prices. 

25 The "cross-product" of the certificate (i.e. free, price, or none) and the level-of- 

26 identification (i.e. anonymous, revealed) triggers a price rule. We allow the owner of the 

27 information to associate a price rule for each combination of certificate and identification, 

28 and with each data element. Many simplifications, for example "no access with no 

29 certificate" and labeling data elements in a class, and providing a price rule for a class of 

30 data elements are possible. 
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1 Without loss of generality, we can describe the price rule for a particular data 

2 element, and a particular cross-product of certification and identification. The price-rule 

3 contains a base-price, which is the price for accurate information about the data element, 

4 and a discount-factor, which reduces that base price by an amount related to the level of 

5 uncertainty in the response. 

6 A query might permit a response that includes a level of uncertainty, for example: 

7 o Is the value of the data field greater than a threshold value? 

8 © What is the range of values in which the value lies? 

9 o Compute an additive noise perturbation, and respond with a "value plus noise 

10 perturbation". 

1 1 The randomized perturbation technique, which allows the agent to return randomized 

12 information about its true value, is disclosed later in this patent. For simplicity, we can 

13 assume that any non-exact query can be associated with an uncertainty- value, which is 

14 computed based on the proportion of the population of data elements that can have values 

15 within the accuracy of the response. The system of SDI can compute this uncertainty- 

16 value, perhaps approximately based on static statistics computed on a periodic basis over 

1 7 information in the database. 

18 For example, SDI might assume that values are uniformly distributed between the 



19 lower and upper values over the population of data elements, and compute uncertainty as 

20 the fraction of values within the range indicated in an agent's response. Suppose that all 

21 data records have values between 0 and 100, then a query 'is x > 70' has residual 

22 uncertainty 30/100 (normalized) if it receives a positive response or 70/100 if it receives 

23 a negative response, while a query 'what is your value of x' has zero uncertainty for all 

24 responses. 



25 Assume that the uncertainty value is between 0 and 100%. The discount factor, 

26 discount(X), is computed as a function of the uncertainty X, with a larger uncertainty 

27 leading to a larger discount. 

28 Price(Q) = Base-Price(Q) - discount (Uncertainty(Q) ) 

29 where Q defines the query that is to be executed on the data element, for example "what 

30 is the value of the data element", or "does the value of the data element lie within this 

3 1 range of values' 5 . The value Uncertainty(Q) is a measure of the residual uncertainty in 
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1 the agent's value for the field after making a response. A simple discount function is a 

2 linear function, for example 

3 discount(Q) = Q * Base-Price 

4 such that an uncertainty of 0% receives no discount, while an uncertainty of 50% receives 

5 a 50% discount. A simple extension might add a multiplier, value between 0 and 1 , to the 

6 discount term — so that the discounted price is always greater than zero. As discussed 

7 above, other more general functions are possible, e.g. non-linear and step functions. 

8 Stage 4. Compute total price. 

9 Finally, the total price of a composite query, query = Q_l, Q_2, Q_N, is 

10 computed in this additive price-rule as the sum of the component prices, where it is 

11 possible that different certificates are required for each element that is queried, as 

12 described by the price-rules for each individual data element. Note, however, that if any 

13 one of the queried elements can not be priced because the querying agent cannot present 

14 a certificate in the set Free or Charge for that element, the price of the complete query is 

1 5 "infinite", and cannot be performed. An agent that breaks a query into a sequence of sub- 

16 queries could perform all queries that do not relate to that data element, but then the first 

1 7 query that relates to that data element is blocked. 

18 4.1 .2 Grouped Additive Price Rule 

19 The grouped additive price rule allows a number of simplifications to the basic 

20 additive uncertainty-based price rule described above. It is designed to allow a user to 

21 define less information in order to specify a price rule for all of its data in the SDI data 

22 warehouse. The basic idea is to group data, so that data elements of the same type can 

23 share the same price rules. We propose data sensitivity types, where each data element of 

24 the same type has the same price function. 

25 Furthermore, this can be used on combination with very simple price functions, 

26 that compute one of four prices: 

27 © Zero price, if an agent presents a Free certificate. 

28 o Low price, if an agent presents a Charge certificate, and requests information with 

29 a High degree of uncertainty. 
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1 o High price, if an agent presents a Charge certificate, and requests information 

2 with a Low degree of uncertainty. 

3 o Infinite" price, if an agent cannot present a Charge certificate. 

4 Different price functions can be defined for anonymous and revealed-identity. 

5 The price rule has the following components: 

6 a) A set of certificates that allow data access for free. 

7 b) A set of certificates that allow data access for some charge. 

8 c) A list of price-functions, each defined with two prices for revealed-identity and 

9 two prices for anonymous identity. The pairs of prices are for accurate 

10 information, and approximate information (with a minimal level of 

1 1 approximation). 

12 d) A mapping of data elements to price-functions. 

13 Again, the price of a full query is computed as the sum price over all information 

14 requests to all data elements. 

15 4.1.3 Superadditive Price Rules 

16 We can allow a non-linear interaction between the price of queries on individual 

1 7 data elements with the introduction of additive interaction prices, which are fired when a 

1 8 threshold is exceeded for the total amount of information released over all data elements 

1 9 accessed in the same query. This is a simple approximation to a more general 

20 combinatorial price rule, which would price all combinations of data access terms 

21 explicitly. For example, a rule could state: "If more than T data elements are accessed in 

22 set Sensitive then add a "bundle price" L to the total sum price of the query."; or: "If the 

23 total weighted access to data elements in set Sensitive exceeds threshold T_l a then add 

24 bundle price L_l to the total price of the query"; where the weighted-access is computed 

25 with weights equal to the degree of uncertainty associated with queries on elements. 

26 4.2 Computing the Price of a Query 

27 A query can be quite complex, involving a number of operations on data stored in 

28 the data repository. Call a query session a sequence of operations performed in response 

29 to a query, and before the answer is returned to the agent that submits the query. 
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1 A general query specifies a program that operates over data stored in the data 

2 warehouse. All queries are of the form: 

3 A = SELECT records with property X 1 ; 

4 PROCESS records in A with method Yl ; 

5 B = SELECT records in A with property X2; 

6 PROCESS records in B with method Y2; 

7 C = SELECT records with property X3 ; 
8 

9 i.e., a general query is a sequence of select and process commands, where process 

10 commands are specified over temporary record pointers that are returned by SELECT 

1 1 commands. A temporary record pointer is valid only in a particular query session. 

12 Properties define Boolean conditions that are computed as the result of requesting 

13 information from a data record. For example, 'is your distance to record X < 0.1\ or 

14 'have you purchased a flight in the past month', etc. Methods define computational steps 

15 that are performed with record pointers. For example, 'if your salary > $40,000 then 

16 compute the similarity between your profile and the profile of record Y\ etc... Both 

1 7 methods and properties may be executed by calls to accessor methods that are associated 

18 with data in the case of a database with heterogeneous data and an object-oriented 

1 9 methodology to access information. 

20 One possible syntax for a language to structure such a query could be loosely 

21 based around C++ or Java, with commands to be invoked as methods of data objects 

22 specified with object oriented notation. The language could be quite general purpose, but 

23 made consistent with the SDI database with a clear separation between the data in the 

24 repository, and the state of the query program. 

25 Access to data is tightly regulated, via data record pointers. Data record pointers 

26 are analogous to variable pointers in C++, and allow a query program to manipulate data 

27 and perform general operations on data records. Data record pointers in SDI provide a 

28 key role, because they ensure that access to information is controlled, according to price 

29 rules that are associated with data. We maintain a single copy of data on a secure server, 

30 and pass dynamic and temporary pointers around while data is processed. A clear 
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1 separation is maintained between the data and the state of the program that executes 

2 queries. 

3 Each query session is associated with a table of data record pointers. The table 

4 records valid data record pointers, that can be referenced in later query operations, and 

5 the current price of queries performed with the data record: 

6 Data Record Pointer | Current price 

7 | 

8 PJ | $0.10 

9 P_2 | $0.20 
10 

1 1 The SDI query execution module performs another key book-keeping service, recording a 

12 list of data record pointers that have been used to compute a result in the program that 

1 3 executes a query. 

14 For example, suppose that set PS contains a list of data record pointers, and a 

1 5 subsequent query operation makes the following command: 

1 6 count = Count(PS, 'property X') 

17 which can be interpreted as "count the number of records in set PS with property X". The 

18 query-execution module maintains a table of variable accounts, which records the data 

1 9 records that were used to compute a result: 

20 Variable Name | Associated data record pointers 

21 I 

22 _count | P_1,P_4, P_6 

23 Jmp | P_3,P_8,P_10 
24 

25 Whenever the value of a variable is computed from the values of a number of 

26 existing variables, then the list of data record pointers is the union over the data record 

27 pointers which were used to compute the values of the other variables. 

28 Eventually, when a result is reported at the end of the execution of a query, the 

29 price of the query is the sum of the requested payment for the data record pointers that 

30 were used to compute the information provided, as stored in the table of data record 

31 pointers. 
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1 Figure 3 illustrates the method diagrammatically. We maintain a "firewall" 

2 between data and the state of the query execution, with all access to the values in data 

3 records via data record pointers, and invoking price rules associated with data records. 

4 On the query-program side we maintain a table of data record pointers, and the cost of the 

5 cumulative information received from that data record under that pointer, and a table of 

6 variables and the data record pointers which were used to compute the values in the 

7 variables. 

8 All manipulation with the values of variables on the "query-side" of the data 

9 firewall can be performed with no additional data-access charge, because data access 

10 charges are only incurred in computing values of variables, when that directly or 

1 1 indirectly requires that additional information be collected about the data records. 

12 The only charges are incurred when the processor receives information. This 

13 information might be in the form of a number, that represents the count of records with a 

14 particular property, aggregate information in the form of a new collaborative filter, or raw 

1 5 data extracted from data records. 

1 6 For example, SELECT operations are free until information in the data record is 

1 7 used to compute the value of something which is used in computing the final response 

18 returned in answering a query. A charge is accounted for against the data records which 

1 9 are queried and selected, but that charge is only levied if information is used. 

20 4.1 .1 Internal Market within the Query Execution Module 

21 The data side of the firewall is also responsible for performing the cheapest 

22 queries possible. For example, if there is a request to retrieve 100 data records with 

23 property X, then the database retrieves the data as cheaply as possible based on the 

24 access-control rules of the owners of the information in the database. 

25 All operations are implemented within an internal market. The internal market ensures 

26 that information is provided according to the price rules associated with data records, and 

27 also places providers of data in competition with each other. 

28 The system of SDI implements an internal market, with queries executed as 

29 cheaply as possible. Both the SELECT and the PROCESS commands allow an associated 
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1 restriction on size, e.g. "SELECT 100 records with property X", or "PROCESS 10 of the 

2 records with data pointers in list XI with method Y". 

3 In one greedy variation, each stage of a sequential query is executed as cheaply as 

4 possible. For example, for each operation the query is implemented by choosing the 

5 cheapest data records that satisfy the criteria. For example, the records that are selected 

6 are the 100 cheapest records to query that satisfy property X, and the 10 records in XI 

7 that are cheapest to process with method Y are selected. 

8 This greedy method is not necessarily optimal for a sequence of queries, for 

9 example when the records that are cheapest for the current query are more expensive for 

10 a later query. Another variation allows a set of records to be maintained, with the 

1 1 cheapest records used in reporting the final result to the user. It might not be optimal to 

12 use the cheapest records at every point. For example, if a query is described as "SELECT 

13 100 records with property X", then "PROCESS with method Y", and "RETURN result", 

14 it can be cheaper to select more records (e.g. 1000) initially, process all the records, and 

1 5 then return the 100 records that are cheapest after the processing. 



16 The method of storing the data record pointers in a table, along with the price 

1 7 required for information, and keeping a record of the data record pointers that are used to 

1 8 compute values during the execution of a query program provides a technique to compute 

1 9 the total price of a query. 

20 In the previous section we described price rules, which are associated with data 



21 records and prescribe on a record-by-record basis the price of executing a query. 

22 However, only data records that are used in computing the final result that is returned to 

23 an agent at the end of a query receive payment. 

24 4.1.2 Query Optimization 



25 Finally, consider the following two examples: 

26 COUNT the data records with PROPERTY X 

27 and COUNT the data records with PROPERTY (not X) 

28 In an optimal pricing model these queries both have the same cost. To focus ideas 



29 further, suppose that no data records have property X, such that without additional query 
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1 optimization the cost of the first query is $0, while the cost of the second query is the sum 

2 cost over all data records of reporting that they do have data property X. 

3 To solve this problem we add a simple query optimization routine in the pricing 

4 module, which can test different formulations of properties (i.e. negatives), and the effect 

5 of reformulation on price. The order of operations for a given expression of properties 

6 does not change the price, but the way in which properties are expressed can affect the 

7 price. 

8 In another extension, we can use statistical methods to optimize performance. For 

9 example, a query 'what % of data records have property X' can be computed using a 

10 sample of the total data records, for an accuracy/price tradeoff. This functionality can be 

1 1 offered within the pricing module. 

12 Another key technique for query optimization is to allow the SDI central server to 



13 perform pre-computation, for example to classify data records into useful bundles, and 

14 allow future queries from agents to limit themselves to bundles of data records that would 

15 appear to have useful properties. The cost is then incurred for further information 

16 revelation from these records, and avoided for information revelation from records which 

1 7 may not have useful properties. 

18 4.2 Example of Query Execution 



19 As an example, consider the following composite query; 

20 1- SELECT all data records that relate to users that have purchased a flight in the past 

21 month, 

22 2- PROCESS: Sort the data records according to salary, 

23 3- SUBSELECT: Limit the records to those users that are willing to have their identities 

24 revealed, 

25 4- RETURN RESULT: Return the identities of the remaining users with the top 100 

26 salaries. 

27 In step 1 data records from the SDI data repository are selected which have the 

28 required property, in this case corresponding to profiles of individuals who have 

29 purchased a flight in the past month. The table of data record pointers is augmented with 

30 temporary pointers (valid for this query session) to records with this property, and the 
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1 price charged by each data record to reveal this information. Notice that the table only 

2 includes pointers to data records that have this property. Although data records that 

3 respond negatively to the query reveal information to the query-execution module, that 

4 information is never revealed to the agent requesting the query, and therefore no payment 

5 is received. 

6 In step 2 the data records that relate to profiles of users that have recently 

7 purchased a flight are now sorted in terms of their salary field. An algorithm to sort 

8 performs a sequence of comparisons between data records, with each comparison 

9 requiring that more information is revealed about the underlying salary of the profile, and 

10 changing the price associated with the data record. The prices are updated in the data 

1 1 pointer table according to the data record price rules, and the query program now has a 

12 list of ordered data record pointers. 

13 Now, in step 3, the query program initializes a new set of data record pointers. 



14 This includes all the pointers for users that are willing to have their identities revealed. 

15 The pointers are maintained in order of salary. At this stage the entry for this new 

16 variable in the table of variable accounts only associates the data records with this new 

17 property. 



18 Finally, in step 4, a new variable is initialized to contain the first 100 data record 

19 pointers from step 3. These are the data record pointers listed in the variable accounts 

20 table. These are the only data record pointers that receive payment as a result of this 

21 query, because the intermediate information provided by other data records remains 



22 within the SDI query execution module and is not released to the agent that performs the 

23 query. 

24 4.3 Discussion 

25 The pricing model as outlined above has the following attractive property: 



26 The price of a query is independent of the order in which operations are 

27 performed, and optimization of order is not necessary for reasons of price. 

28 This is nice, because it means that agents do not have to perform lengthy optimization to 

29 structure queries in cost-efficient ways. The pricing module also ensures that: 

30 Agents only pay for information received as the result of executing a query. 
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1 Finally, we implement an internal market, so that with all other things equal, data mining 

2 and other types of queries are executed as cheaply as possible: 

3 An internal market favors data records with low cost pricing instead of records 

4 with high cost pricing, with all other things equal. 

5 As an example, consider this query: 

6 COUNT (SELECT 'FIELD1 = X' and 'FIELD 2 = Y') 

7 This query can be structured as 'Count (Select(X and Y))\ or 'Count( Select( X, 

8 Select(Y)))', or 'Count(Select( Y, Select(X)))', all with the same cost. Although different 

9 amounts of information is requested from all data records in computing the query in the 

10 three different formulations, the final cost is computed only over the records that have 

1 1 property X and Y, and these records are the same across formulations and have released 

1 2 the same information. 

1 3 Similarly, consider the query: 

14 SELECT(Z, Select(Y, Select(X))); return '2\ 

15 i.e., perform some complex nested query but return the fixed value '2\ The query 

16 execution module tracks that no data pointers are used to compute the value returned to 

1 7 the agent, and the querying agent is not charged for the information provided about data 

18 records to compute the SELECT operations, because that information is retained within 

1 9 the central SDI data server. 

20 In a SELECT query the agent can only ever pay for information provided about 

21 data records that respond positively to a SELECT, because no information is ever 

22 released to agents about the data records that respond negatively. 

23 In another example, consider a query 'COUNT the data records with PROPERTY 

24 X', that returns the result 'none', i.e. there are no data records with this property. This 

25 query is priced at $0 because none of the agents that own information provide 

26 information to compute the 

27 Therefore, a query of the type 'how many data records have type X' that answers 

28 'none' has price $0, because the individual agents have not revealed any information. 

29 Similarly, because we would like all equivalent queries to have the same price, a query 

30 "how many data records do not have type X" in the same database (answer "all of them") 

31 should cost $0 because the same query could have been formulated the other way. A 
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1 reasonable way to ensure that a query is formulated in the cheapest possible way is to use 

2 an query optimization manipulation program, of the type that is common in database 

3 research, during the 'price quote' phase in order to find the formulation with the cheapest 

4 price. 

5 5. Pricing Module. 

6 The pricing module is a core component of the central SDI data warehouse. It has 

7 two functions: (a) to predict the price of a query without executing a complete query; (b) 

8 to aid in the allocation of payment given a price quote. 

9 We build into the pricing module a method to allow an agent to make a 

10 quality/cost tradeoff before executing a query, and even to specify a restricted query that 

11 will respect an agent's budget. Already embedded in the query-execution module is an 

12 internal market for data records, to ensure that cheaper data is used in preference to more 

13 expensive data access, given two identical data records. Furthermore, already embedded 

14 in the query-execution module is that agents only pay for the provision of information. 

15 Finally, the method computes a price for a query which is independent of the order of 

16 operations. 

1 7 We propose the following method for providing initial price quotes to agents in SDI: 

18 1 . Compute a range of price quotes in response to a query, Tor different amounts of 

19 computation (which can be varied by limiting the input set of data records), and for 

20 corresponding values of a "quality metric" provided by an agent that submits a query. 

21 2. Allow the querying agent to select an optimal price/quality/computation tradeoff, 

22 and then receive payment, execute the query, and provide the response. 

23 3. Allocate payments to accounts in the eBank that correspond with data records 

24 which are used to compute the value of the result of the query, with payments 

25 adjusted if necessary to account for any error in the price quote. 

26 5.1 Computing a Price for a Query 

27 We allow a small fixed price F for an initial price quote, to cover the cost of 

28 computation and prevent agents saturating the server. The price module computes a price 

29 quote based on a statistical technique to compute the price of the full query without 

30 executing the full query. 



Page 53 of 370 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 The basic idea is to perform the query on a number of records, e.g. 1%, 5%, and 

2 10% of the total records that will be used for the full query, and then extrapolate to the 

3 full query size. We need to perform the complete query because on a subset of data 

4 records, instead of part of the query on all data records, because query programs can have 

5 different phases — and each phase might incur very different information costs. The 

6 number of records that are used for the query can be restricted by making a random 

7 sample of the total domain of data records used for a full query. 

8 5.2 Allowing a Price/Accuracy Tradeoff 

9 In computing the initial price quote we also suggest a tradeoff for the user 

10 between "result quality" and cost. Quality is subjective, and usually best measured by the 

1 1 agent that requests a query on the data. Therefore, we suggest that the querying agent 

12 should provide the metric, but limit this metric to a scalar value to prevent information 

13 leakage. We allow an agent in formulating a query to specify a key metric that it will use 

14 to choose an appropriate size of query to execute. 

15 The core technique that we use to control the accuracy of the result of a 

16 computation is to limit the number of total data records that a query runs over as input to 

1 7 the query. Take a random selection of all the data, and use that for the processing. At this 

1 8 initial step we limit the price and the amount of computation performed. 

19 As an example, consider a query to collect pseudonyms for agents with useful 

20 properties. The metric of interest in this case is the average per-pseudonym cost, and this 

21 can be provided by the pricing module without providing any useful information to the 

22 querying agent Given this information the agent can then decide whether to proceed with 

23 a query, and how much money to allocate. 

24 Similarly, perhaps a querying agent cares about the quality of match between data 

25 records and data records with ideal properties. The agent can provide a method to 

26 instrument its query to compute this quality, and the price module can in pricing the 

27 query provide guidance about the tradeoff between running the query over different 

28 numbers of data records. 
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1 Another more involved method is to introduce randomization as the query is 

2 processed, because some owners of information may provide randomized information 

3 more cheaply than exact information. 

4 That will vary depending on the number of data elements that are queried. 

5 5.3 Making a Price Quote for a Budget 

6 Consider the problem of an agent that wished to perform a query with a budget B. 

7 For example, I would like to spend $1000 to compute a collaborative filtering model. 

8 There is a particularly simple way to allow this: 

9 First, estimate the cost of building a collaborative filtering model on the basis of 

1 0 TV data records, selected at random (or with some preprocessing of the database to 

1 1 identify useful data records). Estimate price P for the query. 
12 

1 3 Second, adjust the number of data records used to compute the actual query based 

14 on the price P that was computed, and the budget B. The number of data records, 

1 5 assuming a linear model of per-data record charging is simply N x B/P. 

1 6 If there is reason to believe that the cost of a query is a non-linear function of the 

17 number of data records, then a number of samples could be priced, for suitable sample 

1 8 sizes N_l, N_2, . . . etc. 

1 9 Ultimately, the result is a query that is adjusted in scope by an upfront limit on the 

20 number of records used for the query domain. The price module can scale the cost of the 

2 1 query, and make an appropriate price quote. 

22 5.4 Optimizing the Price of a Query 

23 We also suggest a technique to adjust the formulation of properties in a query, e.g. 

24 c COUNT(SELECT records with property X)', could be reformulated as 'Count(all 

25 records) - COUNT(SELECT records without property X)', possibly for a smaller price. 



26 A random search technique can provide one simple approach to optimize query formulas. 

27 6. Communication Module 

28 The communication module in the central data warehouse takes incoming messages from 

29 agents and decides how to handle them, and sends responses to queries back to agents. 
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1 The action selected in the communication module depends on the type of the 

2 message, for example the message can be a data-update request, a new query to execute, 

3 a request to add new profile information, a request to find agent profiles that match, or a 

4 request for processing a set of profiles that meet a particular criteria with an application 

5 in the database. 

6 Figure 4 illustrates the communication module in block diagram form. Messages 

7 are received from agents, and passed to the data repository in the case of data-update 

8 commands, or requests to add new information to the data repository. Alternatively, if the 

9 message represents a query then it is either placed in the queue for execution, i.e. the 

10 pending queries queue, or in the set of persistent queries — which are periodically checked 

1 1 by the query execution module and executed when the correct conditions exist in the data 

12 warehouse. The communication module can be implemented using standard message- 

1 3 handling methods known to those skilled in the art. 

14 7. Physical Location of Data The default implementation of SDI assumes that 

15 information is physically stored in the central SDI data warehouse. However, this is not 

16 necessary. The system of SDI is designed to present a privacy-protected marketplace for 

17 data, where queries can be executed with complete transparency as to the physical 

1 8 location of the data over a network. It is not necessary that data is physically uploaded to 

19 the central SDI data warehouse. Instead, we allow a user of SDI to provide a virtual 

20 pointer to the location of the data on another machine, for example hosted by a trusted 

21 third-party, or for example their own data server when that server is located within their 

22 own facility and always online. Alternative data locations provide tradeoffs between 

23 computation efficiency, communication cost, delays, and privacy. 

24 7.1 The Convergence of Network Devices 

25 The invention of SDI is in no way limited to a centralized implementation. The 

26 invention is characterized by its methods and protocols for handling personal information 

27 and query execution. As best as possible the invention applies to any distributed system 

28 implementation that follows the disclosed methods. 

29 Users with different types of personal computing devices can interact in a 

30 seamless fashion with the information and personalization methods provided within SDI, 
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1 with implementations that allow remote data access and mobile data. The user must either 

2 be identified by permanent devices as he/she moves around, or the user has his/her 

3 dedicated mobile device. 

4 Co-pending patent application, entitled Location Enhanced Information 

5 Architecture (LEIA), provides a basic framework for this type of multiple-device data 

6 collection and delivery network environment. It describes a device and location 

7 independent methodology for identifying individuals utilizing an artifact termed a user 

8 identifier (or UID). This artifact encompasses a variety of more specific media for 

9 identifying individuals such as smart cards, credit cards, biometrics, EZ Pass, GPS, and 

10 the like. Additionally our issued parent application to this case entitled: "Pseudonymous 

1 1 Server for System for Customized Identification of Desirable Objects" [U.S. PATENT 

12 5,754,938] suggests a means for cross correlating user data for profiling purposes. The 

13 data may be collected from both on-line and off-line media sources, including the user's 

14 personal computer, purchase data collected by a vendor or credit card company. The 

15 profiling can be used to deliver personalized recommendations to the user through the 

16 web, through email, or even through point of sale terminals at a retailer's physical store 

1 7 location. 

1 8 Other devices which may be used to push personalized information, and to collect 

19 profile information, include the users set-top box, in home appliances (such as stereo 

20 system), an automobile on-board computer, a personal digital assistant (PDA), a cellular 

21 phone, a pager, an ATM, a fuel pump, point of sales-terminal, wearable computers, etc. 

22 In most of these examples, the device is typically a light client (which is descriptive of 

23 these "Network appliances")- One variation of SDI's privacy architecture pushes a proxy 

24 server to the client-level, where a user's personal profile information is stored. This 

25 provides a secure technical framework for protecting user profile data. In the case that 

26 the client memory is limited, because the user has a "light" network client, then the proxy 

27 server functionality can be pushed to the ISP or a central SDI server. If the light client 

28 consists of a remote shared terminal (e.g. ATM kiosk, POS device), personal data may be 

29 transmitted from the user's own client device if/when and at the time it is needed and 

30 authorized by the user. This approach would be certainly feasible within the "always on" 
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1 client scenario, for example with a user's data stored at his/her set-top box, which is 

2 always "on-line". 

3 The devices with which a user interacts can check a user's data use policy, and 

4 then push data accordingly to the SDI network servers. Because servers at the network 

5 level are inherently not as secure as the user's own client, extremely sensitive personal 

6 information may still be stored even at a light client, i.e. data can be split across multiple 

7 locations. In general the user may make a tradeoff between data management (latencies, 

8 bandwidth) costs and with the potential loss of information that can occur when data is 

9 released to other parties, even if that party is the central SDI data server. 

10 Infrastructure for Smart Home / Smart Office Intelligence The management of 

1 1 the potentially vast amount of information generated by such devices, as well as its 

12 privacy protection, is enabled by the "personal agent", an on-line software entity that 

13 represents and acts on the behalf of the individual user. Furthermore, this agent is 

14 capable of communicating with and negotiating terms with agents representing other 

1 5 entities, including systems that form the infrastructure backbone for intelligent homes and 

16 offices. 

1 7 Such systems (as embodied by the LEIA disclosure) have access to a broad range 

18 of input sources, allowing them to monitor different aspects of the user's activities or 

19 location. Inputs can include data derived from: 
20 

21 * Email, wireless PDA, telephone communications. 

22 * Web browsing 

23 * Personal Computers 

24 * Personal Digital Assistants 

25 * Pagers 

26 * Wrist watches (typically one-way, receive only) 

27 * ATM machines, point of sale kiosks 

28 * Ceiling-mounted video cameras 

29 * room-based motion and heat detectors 

30 * Internet-enabled automobile 

3 1 * Chip-enabled appliances 
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1 * Electronic Schedule Books 

2 Smart Home Application The concept of a "smart home" is not a new one: Even in 

3 the 1950's futurists discussed the possibility of imbedding intelligent systems into a 

4 house, relieving the homeowners of such boring daily tasks as watering plants or 

5 adjusting the thermostat. Such a system could easily be implemented today using control 

6 computers running simple sets of IF-THEN rules, but might soon prove unreliable 

7 because of its inability to adapt to new situations. For example, a garden party could 

8 easily be ruined if inadvertently scheduled at the same time as a scheduled lawn- 

9 watering. 

10 LEIA provides a much more sophisticated alternative. Given a house that 

1 1 incorporates an internal local area network (LAN) linking household appliances, sensors, 

12 communication devices, and home computers, LEIA could easily form the basis for a 

13 truly intelligent and adaptive home environment capable of handling daily chores, 

14 security, and information delivery. 

15 Firstly, LEIA could easily subsume the tasks traditionally slated for use in a smart 



16 home; however, its ability to infer the location and needs of family members would 

1 7 greatly enhance the handling of these tasks. For example, the above-mentioned garden 

1 8 party would not be spoiled because a motion sensor would inform the watering system 

19 that a party is taking place on the lawn. Or, being set in a power-saving mode, LEIA 

20 could light only those rooms currently being used; as a family member moved around the 



21 house, lights would be turned off and on, tracking the person's movement so that only 

22 their immediate surroundings are lit. 

23 

24 Secondly, LEIA could use its ability to predict informational needs (in a manner 

25 very similar to that used for the Automotive Information Delivery System) in the house, 

26 customized to the personal tastes of different family members. One could imagine recipes 

27 being delivered audibly in the kitchen, or a television program that follows a family 

28 member from room-to-room as he strolls around the house. 

29 Finally, LEIA would have the sophistication needed to deal with intelligent 

30 appliances, telling the swimming pool to clean itself, for example, an hour before the 

3 1 commuter returns home. 
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1 LEIA provides a unified infrastructure for the increasingly sophisticated systems 

2 being incorporated into modern homes. It would act as the central manager, making sure 

3 that basic household tasks, security, and informational needs all work in a coordinated 

4 and intelligent manner, adapting smoothly to the uncertainties of daily life. 

5 A smart home would have several different types of sensors: 

6 i) "Passive" detectors - 

7 *motion detectors 

8 *sound detectors (including voiceprint and speech analysis module) 

9 *Active Badge detectors 

1 0 *Window and door sensors (are they open/closed?) 
11 

12 ii) sampled communications 

1 3 telephone 

14 * e-mail 

15 *radio/TV 

16 *computer usage 
17 

1 8 iii) appliances 

19 * alarm clock 

20 *light switches 

21 * stove 

22 *dish washer/clothes washer 

23 *water taps 

24 In addition, accessed via secure channels (perhaps using a personal agent as an 

25 intermediary), three major types of personal data are also useful for running the smart 

26 home: 

27 i) Explicit rules for routine tasks (E.g., desired schedule for lawn watering) 

28 ii) Expressed preferences (E.g., a desire for morning news about international 

29 events) 

30 iii) Historical database of previous interactions between the family and the house 

3 1 (This will allow LEIA to learn daily routines and family habits). 
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1 

2 There are two major classes of data that need to be served by LEIA: (1) control 

3 signals for household systems and appliances, and (2) information desired by household 

4 members. 

5 To a large extent, household systems can be appropriately controlled using a 



6 traditional rule-base, although a fuzzy rule-base would be inherently more flexible (IF 

7 <the bedroom is somewhat chilly> AND <it's almost time to wake up> THEN 

8 Substantially increase the bedroom's heat>). Many of these rules would be explicitly 

9 programmed by the family from the beginning; others could be learned using inductive 
1 0 methods. 



1 1 Although most of the household systems could run unsupervised using such a rule 

12 base, LEIA would remain the ultimate arbiter, and could override the rule-based system 

1 3 in exceptional cases. For example, if the family suddenly has to go out-of-town, LEIA 

14 can infer their absence and inform the household systems to go on energy-saving modes. 

15 It should be noted that certain state variables will have a major impact on this 



16 analysis. The date, time, mood (inferred from sounds levels and voice content), and 

1 7 current composition and location of people in the house should be carefully conditioned 

18 on. Strong daily routines will emerge (for example, a typical working day), but 

19 exceptions will also be noted (when the teenaged son is home alone, and in an exuberant 

20 mood, play pop music LOUDLY). 



21 Extension to Data Delivery Given the ensemble of sensors that form part of the 

22 smart home, it would not be difficult to monitor the types of off-line information being 

23 consumed by the inhabitants. Biometrics (systems monitoring body state, facial 

24 expressions, retinal reactions), sound sensors (capable of translating speech to text, and 

25 then text to content, using natural language processing techniques), and video monitors 

26 (which could observe reading material such as magazines, textbooks, tv viewing guides, 

27 cookbooks, etc., and again use NLP methods to scan for content), would all form part of 

28 the input set. 

29 This information is combined with all other sensor readings to infer the general 

30 focus of user interest. Within the scope of the present network architecture, the parent 

31 patent application describes how such contextual data can be converted into target object 
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1 profiles and through the use of the profile processing module "similar' target objects may 

2 be retrieved from the electronic medial distribution system (cached locally or over the 

3 network) and delivered and displayed at the local user terminal device for viewing by the 

4 user. Such media (which may be tailored for delivery at a RV set-top) may include 

5 streaming video/audio, multi-medial files or video-on-demand which are delivered in 

6 near real time upon activation of the request and which are of matching similarity (or 

7 complementarity) to the print media viewed by the user presently. Alternatively, the data 

8 captured by the miniature camera device relating to the particular pages of print media 

9 presently observed by the user may consist exclusively of an identifier code a portion of 

10 which is used by the system for purposes of identification of the code (or alternatively 

1 1 such identification may be determined by its physical coordinates on the page itself). The 

12 other portion of the code is a unique identifier of the particular page, which may be used 

1 3 to identify manually by a content expert whereby select print materials and particularly 

14 their associated electronic counterparts are either matched, designed and/or customized 

15 for use within their system context of the present convergent media application. It is 

16 apparent that such a system could also be usefully deployed to benefit advertisers, e.g., 

17 presenting advertisements electronically to correspond with particular printed content or 

1 8 matching printed ads with corresponding electronic counterparts. 
19 

20 This complimentary content, be it audio, video, text, or combinations thereof, 

21 including advertising conforming to user-defined conditions, is delivered through various 

22 in-home devices located in the proximity of the user. 

23 Smart Office Application Large business centers can be very confusing places: 

24 people are often away from their desks, many different meetings are scheduled and 

25 rescheduled during a single day, people are called off to emergencies, and clients drop in 

26 for unexpected visits. The inability to coordinate people and information in such 

27 situations leads to greatly reduced efficiency. 

28 LEIA offers a unified solution in the face of this chaos; by linking together 

29 people's schedule calendars, communications systems, contact information, and location 

30 coordinates, a system can be devised to (1) shepherd a visitor (even if unexpected) 

31 around the business center, (2) push vital contact information about the visitor to those in 



Page 62 of 3 70 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 the company who will be visited, and (3) adaptively coordinate important meetings 

2 within the company. 

3 When a visitor comes to the center, he will: 

4 a) sign in at a front desk; rather than sign a book, he logs in at a terminal that 

5 signals his presence to the company LAN. 

6 b) receive a hand-held computer that, while presenting the visitor with useful 

7 information, will emit a constant signal enabling LEIA to pinpoint his position at 

8 all times. 

9 On the employee side, LEIA can acquire signals from: 

10 a) active badges 

1 1 b) telephones 

12 c) beepers 

1 3 d) interactions with terminals 

14 All sensors are connected directly to a LAN internal to the company. Only basic 

1 5 security measures need to be enacted. 
16 

17 Registered visitors will likely be profiled in standard contact files containing 

18 information about their own company, position, professional interests, previous 

19 interactions with the company they're visiting, and basic personal details (e.g., 

20 photograph, wife's name). 

21 Employees of the company using this LEIA-based system will have much more 

22 information available. In addition to the standard contact information, LEIA will have 

23 access to their phone and e-mail logs, daily calendar/schedule, work group and project 

24 details, position within the company hierarchy, and work-station notes (e.g., an 

25 unexpected "out to lunch" message could be posted on-screen by the employee before 

26 they run off). 

27 Although some higher-level intelligence is needed (e.g., for matching interest 

28 profiles between employees and visitors), the office-based system could well be run with 

29 a combination of a rule-base and optimization routine (which would be capable of 

30 coordinating conflicting schedules). Since this version of LEIA will be installed in a 
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1 fairly controlled environment, the problems of location and timing will be fairly trivial, 

2 the former solved by active badges, the latter solved by smart calendar agents. 

3 

4 For the visitor: 

5 If the visit is expected (and previously scheduled), LEIA will double-check the 

6 availability of the personnel to be visited and confirm or alter the schedule as needed. If 

7 the visit is unexpected, or certain employees are unavailable, LEIA will take into account 

8 the purpose of the visit and the visitor's professional interest profile to locate those 

9 available employees who would be best suited for a meeting. 

1 0 Once the visit schedule has been arranged (although it could be adaptively altered 

11 during the course of the visit), the visitor will be fed information via the hand-held 

12 computer they've been issued. This might give them a visual representation of the 

13 building's layout, along with the best route to the next office to be visited (a more 

14 advanced building might simply have computer displays in its corridors capable of 

15 flashing directional signals to the visitor as he passes them). In addition, it might give the 

16 visitor basic information about the person he's about to visit (for example, they might 

1 7 have interests in common, or have gone to the same university). 

1 8 For the employee to be visited: 

1 9 Whether or not the visit was previously scheduled, LEIA communicates with the 

20 employee's personal calendar agent and double-checks the employee's current location. 

21 If the employee is at a fixed location and has some free time, the visit is scheduled. 

22 Obviously, his connection to the visitor will determine what information, in addition to 

23 the standard contact data, should be pushed to his workstation. 
24 

25 For multiple employees requiring a meeting: 

26 Given that multiple employees have signed up for a meeting, or that a single 

27 employee has sent out a signal indicating the need for a meeting and its priority (standard 

28 or emergency), LEIA gets to work coordinating the details. Cell phone and beeper signals 

29 give employee locations outside the office, active badges signal employees already at the 

30 office. Calendar agents are contacted for availability; a standard meeting might depend 

3 1 on people having time slots free, whereas an emergency meeting would be given priority 
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1 status, and could bump items already scheduled (given that they have been assigned low 

2 priority). 

3 An optimal meeting time and place (or teleconnection for remote employees) is set up, 

4 and any pre-meeting information is forwarded to the appropriate workstations. 
5 

6 meeting, engaged in an important phone conversation 

7 Resolution Credentials Section 2.2.1 of this patent discloses the use of resolution 

8 credentials for the facilitation of agent-agent interactions. Resolution credentials enrich 

9 the quality of such interactions because they certify the status claimed by individuals, 

10 allowing agents to maintain relationships of trust. On a more practical level, resolution 

1 1 credentials are of use for matching and introducing users to each other on the basis of 

12 common interests, for guarding access to high-level users (with the agent acting as a gate- 

13 keeper), and for flagging times at which an individual is reachable (and not tied up in 

14 meetings, or the like, as specified by the scheduling agent).7.2 General Description 

1 5 We allow a user of SDI to push some data A to the data warehouse, and make this 

1 6 data available to all other agents via the SDI query-execution module, and to retain local 

1 7 control over other data B on the client machine. In this section we discuss this and other 

1 8 variations, and explain how the functionality of the central SDI data warehouse can be 

1 9 replicated on a client machine in a client-side SDI database, to allow a user to leverage 

20 the value in local data when interacting one-on-one with another agent. For example, a 

21 consumer's shopping agent might wish to use local data about its preferences to allow an 

22 online vendor to configure its services, but without releasing the information to the 

23 vendor. We explain how this can be supported within SDL 
24 

25 We allow the following variations: 

26 1. A user can store some information on his/her local client machine, with that 

27 information provided to other agents directly, or retained for querying by other 

28 agents on the local client machine as in the system of query-execution on the 

29 central SDI database. 
30 
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1 2. A user can use a light client machine, perhaps a mobile computing device, which 

2 is periodically configured with information stored on the central SDI database that 

3 is relevant to the current task. This information can then be used directly by other 

4 agents in interactions. 
5 

6 3. A client machine can be automatically configured with information about a user 

7 when the user is in its proximity, again to allow direct information exchange with 

8 a local agent. 
9 

10 4. The client machine can maintain all information, with virtual links from the 

1 1 central SDI data warehouse to the information. The system operates as before, 

1 2 with information provided if conditions are met, etc. 
13 

14 There are two basic motivations at work. One, a user might prefer to keep control 

1 5 over some information, even keep control away from a central SDI data warehouse, in 

16 case it is compromised in some way. We can allow a user to maintain data on a local 

1 7 client machine but seamlessly receive the same benefits as if the data was pushed to the 

1 8 central SDI data warehouse, with virtual links to the client machine. Query-execution can 

19 operate much as before, with data pulled from the client machine as necessary and 

20 charges levied. Operating overhead is incurred in such a system because data must be 

21 communicated over the Network during query execution. The client machine must also 

22 be "fat", with enough local data storage. 

23 A different motivation occurs when a user has a very light client, for example a 

24 mobile computing device such as a Palm computing device or a cell phone. In this 

25 variation the user will perhaps like to store profile information relevant to the profile that 

26 it assumes with another agent in a one-to-one interaction, but not have space to store all 

27 profiles. The role of the central SDI data warehouse is to configure the light client 

28 dynamically. 

29 With information on a user's client machine, then we can support local queries 

30 from other agents, for example on-line vendors might request information to allow an 

3 1 appropriate product to be displayed to a user, or a relevant advert. The queries can be 
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1 executed in a number of interesting ways. First, the information can simply be released to 

2 the agent, and the agent can make appropriate use of the information. Second, the 

3 information can be retained on the client machine, and the agent can submit its selection 

4 function, and receive the optimal selection, for example a good advert to show the user. 

5 One interesting implementation of such a client-side data warehouse is via smart 

6 stick technology, developed by Sony, which is a robust "data wand" that allows the 

7 storage of configurable data. 

8 7.3 Important Data Location Variations It is useful to identify four key modes of 

9 interaction between a pair of agents within the system of Secure Data Interchange. The 

10 agents might, for example, represent a consumer and a vendor in a B2C e-commerce 

1 1 application. The modes of interaction are adjusted to allow for different data locations. 

12 7.3.1 Data in SDI data warehouse In Figure 5 we illustrate the situation where agent 1 

13 has its personal information about the user stored in the central SDI data warehouse, and 

14 provides agent 2 with the ability to process a query on the data warehouse and access 

1 5 information about the agent. The query might either request information about the agent's 

16 profile, or provide a method to execute in the data warehouse, where the result of the 

1 7 method is an optimal action for Agent 2 to make in providing personalized information 

1 8 and/or services to the user with agent 1 . 

19 7.3.2 Data on the Client Machine (client-side SDI database) and in SDI data 

20 warehouse 

21 

22 Figure 6 shows an alternative variation where data is stored on an agent's client 

23 machine and in the SDI data warehouse, and the client-side SDI data warehouse can 

24 respond to queries from agent 2 in the same way as the central SDI data warehouse. In 

25 particular, as we suggest later in this document, this variation allows an agent 

26 representing an on-line consumer to store very personal information on a local machine, 

27 and for example only allow other agents to benefit from the results of analysis on that 

28 information without receiving the actual information. 

29 7.3.3 All information stored Client-side. Another variation on the basic SDI 

30 architecture, is that all information is stored only as virtual links within the SDI data 

3 1 warehouse, with physical storage on an agent's local client machine. Preferably, in order to 
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1 leverage the value of information the client machine would be always networked with the 

2 central SDI data warehouse. A current technical solution which seems appropriate would 

3 use a Set Top Box, connected via cable TV lines to the SDI network. Many households in 

4 the US have set top boxes, and this would provide a quite convenient way to leverage the 

5 value of information but keep it secure against inappropriate access in a central data 

6 warehouse. 

7 In this variation, to support a personalized session with another agent the data is 

8 stored locally, and can be processed using a client-side SDI data warehouse with a local 

9 query-execution module. To also support data mining by agents of a shared database, the 

10 client-side data records could useful push data types and price rules to the virtual shared 

1 1 database, to allow more efficient searching. This variation describes a distributed SDI data 

1 2 warehouse, with indexing performed over on-line client machines. 

13 8. Distributed Query Execution In addition to allowing distributed data location, the 

14 system of SDI allows distributed query execution, via client-side query-execution modules. 

15 The advantage with this variation is that data can be physically stored on trusted local 

1 6 machines close to a user of SDI, and the queries can be physically executed on the machines 

17 so that not even intermediate results are available outside of a local firewall protected 

18 system. There is no technical solution provided within SDI to prevent agents sharing 

19 information to other agents, other than to prevent agents from receiving that information in 

20 the first place. We do this using three many techniques: pseudonyms, information 

21 randomization (see Section 10), and query-execution in the SDI central server without 

22 release of raw data inputs. 

23 We can implement safe client-side query execution with direct replication of the 

24 key functionality of the query-execution and price-module of the central SDI data 

25 warehouse on a user's client machine. A querying agent can push the same method to an 

26 agent's client machine as it would submit to the central SDI query execution module, and 

27 an agent can execute the method locally and release information consistent with his/her 

28 data release rules. The SDI client-side agent can be configured, via the profile 

29 management and data-release policies, to control the type and amount of profile 
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1 information released to the central SDI data warehouse. We discuss this in some detail in 

2 Section 9, on "distributed data management". The challenge is to be careful that 

3 the results of a query do not reveal too much information, but this is solved using the 

4 same price rules as in the central SDI data warehouse. The mode of interaction supported 

5 is QUERY(PID), i.e. execute a query on a particular pseudonym ID, and the client side 

6 query execution module ensures that the vendor has a certificate from the client-side SDI 

7 agent to query the information in the local database about that pseudonym. The control 

8 over the profiles that a vendor can access, coupled with the price rules to ensure that 

9 information is perturbed as necessary and that not too much information is released, and 

1 0 to allow an agent to leverage the value of information. A user's agent can retain 

11 control over local information by limiting the information that can be released in 

12 response to a query. For example, although a query can be complex and access a lot of 

13 profile information on a user's client machine, we suggest an additional protection that 

14 controls the information that a vendor can receive for future use. It is possible to limit the 

15 response that the vendor receives to a fixed message size, e.g. to 10 bits for example, to 

16 provide a very strong overall control on the ability of a vendor to use information again in 

1 7 the future. 

18 8.1 Example: Customized Vendor Web Pages 

19 The method, called safe client-side query execution^ has a direct application to 

20 systems for personalized information delivery, where it is not desirable to release profile 

21 information to an information provider, but preferable to allow the information provider to 

22 provide methods (e.g. queries) that are executed on a client machine, and personalize 

23 information before it is provided to the end user. A key application of this technique is 

24 in B2C on-line e-commerce where a vendor can push methods for personalization of a 

25 virtual shop front to a user, and the user's SDI enabled client machine can implement the 

26 methods (which are queries) on the client, and push the result back to the vendor's server. 

27 The server then constructs and pushes an optimized virtual shop front to be displayed on a 

28 user's local display. At present advertising networks such as DoubleClick 

29 (www.doubleclick.com) are able to track a user as he/she browses across multiple sites in 

30 the DoubleClick network because the ad server can place a "cookie" text string on the hard 



Page 69 of 370 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 drive of a user's client computer which identifies that user as he/she browses. SDI is 

2 designed to be used on-line in combination with a method to block cookies, replacing them 

3 for example with "safe cookies" [Netscape 96] to still permit stateful interactions during a 

4 single session with a vendor, for example "shopping basket" style interfaces. The SDI-client 

5 side proxy agent can replace the role that cookies take in automatic user log-in and password 

6 checking, through a Janus/LPWA-style implementation [BGGMM 97; BGGMM98; 

7 GGMM98]. A user's log-in user name and password can be derived from a user's 

8 pseudonymous identity, and computed within a cryptographic framework. Janus also 

9 provides a technique to allow a user to receive e-mail pseudonymously. 

10 A core technique in SDI is to use a unique pseudonym for each vendor, and then 

1 1 selectively provide vendors with profile information from across multiple pseudonyms; 

12 either anonymously within the central SDI data warehouse for data mining purposes, or 

13 alternatively via client-side personalization so that a vendor can leverage a user's wide 

1 4 profile data without receiving direct access to that information. 

15 In providing profile information to a vendor during an interaction, so that the 

16 vendor can provide targeted products and servers to the user based on data mining that it 

17 has performed on aggregated user data in the central data warehouse, the user's agent 

1 8 must be careful to protect the identity of a user. This means that the agent must not reveal 

19 information under pseudonym P_l and information under pseudonym P_2 to another 

20 vendor that allows the vendors to link the pseudonyms and reason that the agent 

21 represents the same user. 

22 We describe in Section 1 0 the technique of random noise perturbation to release 

23 information in response to queries, and another simple technique is to carefully protect 

24 particularly sensitive information (e.g. social security numbers, etc.) 

25 8.2 Client-side Query ExecutionAs described earlier, a key variation of SDI retains a 

26 local data warehouse and query execution module that contains information specific to a 

27 single agent, on the client machine of that agent. It is then possible, via the same query 

28 execution controls as described in the central SDI data warehouse, to allow vendors to 

29 characterize and profile a user based on its local data, but without gaining explicit 

30 information about the user's local data. The vendor can push appropriate methods and 
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1 targeted services. We describe a number of bottom-level applications later in the patent, 

2 including personalized web pages, ad-networks, etc. 

3 We describe one variation in which the vendor provides generic information to the 

4 client-side device, that filters that information locally. This is relevant for example in high 

5 bandwidth information services such as digital television and satellite systems, where a 

6 user's set top box can store program information locally and make local decisions about 

7 what programs a user might like to watch and when, without providing any profile 

8 information to the head-end server. In this variation an agent can submit a query to be 

9 executed locally on a client-side SDI data proxy, that stores information submitted by the 

10 local agent. The applications of this technique are exciting, because it allows a user to 

1 1 maintain even more control over profile information but still benefit from personalized 

12 interactions with vendors. The client-side SDI data proxy just contains data for the local 

13 agent, and allows agents to submit 'personalization queries' of the type 'execute this 

14 profiling rule and tell me what product to show the user'. The outcome of the processing 

1 5 might be personalized information, for example a selection of books that a user might be 

1 6 particularly interested in based on its profile information and the methods passed to the 

1 7 agent by the agent that sends general information about the books in its catalogue. Figure 

1 8 7 illustrates the process. The SDI proxy agent maintains a set of profiles for a user, relating 

19 to transactions that the agent has performed with other agents in the system, and also other 

20 information that relates to the user associated with the agent. A vendor agent can send 

21 generic information and a method for personalization, which is executed as a query on the 

22 local profile data base, with the results allowing a vendor to decide how to target its products 

23 and services to a user. For example, a book retailer might be able to provide a selection of 

24 books that a user might be particularly interested in based on its profile information and the 

25 methods passed to the agent by the agent that sends general information about the books in 

26 its catalogue. We can provide rule templates, which are used to select the type of method 

27 that the provider of the information is requesting be applied to the generic information that is 

28 passed to the requestor agent. The provider agent only needs to provide parameters for the 

29 processing. For example, if collaborative filtering is the desired personalization technique 

30 then the providing agent provides a list of prototypical cluster centers, but does not need to 

31 specify a collaborative filtering algorithm. The interpreter takes the information and 
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1 methods from the providing agent and selects the appropriate rule template to form a 

2 program that is executed as a safe query in the CPU, with the usual controls over the amount 

3 of data that a query can access. 

4 The output from this processing is personalized information, that can then be displayed to 

5 the user, perhaps to a final test in the control module for applicability. 

6 8.2. 1 Preventing Information Leakage The challenge is to be careful not to allow the 

7 results of a query, even those data is not explicitly revealed to an agent that submits a query, 

8 to carry sensitive information that a user would rather not reveal. We allow this by: (a) 

9 partitioning a user's local profile into a number of different identities, so that a vendor can 

1 0 only access data for a particular profile; (b) allowing only "safe" information to be reported; 

11 (c) allowing only "safe" data fields to be accessed. One particularly simple technique is 

1 2 to limit the size of a response to a query by the number of bits, because this can quite nicely 

13 prevent the release of too much information, when coupled with close control over the type 

14 of data that is accessed. For example, a user might be willing to allow queries with any 

1 5 amount of "less sensitive" information, with the understanding that the vendor only receives 

16 at most 10 bits of information. As described above, this profiling might also be a 

1 7 sequential iterative process, so that the vendor does not need to transmit a compete decision 

18 tree up-front, but request a sequence of information. Only release information to a vendor 

19 that we select, and do not allow the same vendor to access too much information over an 

20 extended period of time. 

21 8.3 Extended Example: Managing Distributed Personalizationln this section 

22 we describe an extended application of distributed query execution, for a vendor that 

23 wants to customize its products and services to a particular user. A central application of 

24 SDI is a system where users submit profile information to the shared SDI data base, and 

25 receive personalized information and possible cash payments in return. This variation has 

26 users optionally submitting information to the central SDI database, and receiving 

27 personalized information based on statistical patterns that a vendor is able to assess from 

28 the information that is submitted by other agents. A vendor can provide a user with 

29 personalized content that reflects his/her interests based on the types of information that 

30 other agents with similar profiles like to receive. 
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2 Figure 8 provides a top-level view of one such agent, that we refer to in this section 

3 as a personalization agent. We consider an agent that maintains an identifier and (partial) 

4 profile information for the agents that it has interacted with. Consider, for example, email 

5 addresses of customers to Amazon.com. The vendor-side SDI proxy can maintain profiles 

6 for each user that it interacts with, indexed on the basis of the log-in identity that a user 

7 users. The agent can also use a "processor" device that takes data about the profile of a user, 

8 and the rules that have been collected based on profile information in its local database and 

9 from wider SDI query execution, and computes the personalized information that n agent 

1 0 might be most interested to receive. The personalization agent's information database 

1 1 (Figure 8) contains all information that the agent can provide to other agents, for example 

12 this would be all of today's news stories in the case of an agent representing the NewYork 

1 3 Times. The rules database contains methods that the agent can use to decide how to select 

14 information to present to a user based on profile information about the user. The profile 

15 information can be stored local to the personalization agent for the information that the 

1 6 agent has collected about other agents, other information will be stored either in a central 

1 7 SDI database, or client-side at the user's SDI proxy. The communication modules receives 

18 requests from information from agents, and information about agents' profiles, either from 

19 agents directly or from the central SDI server. The communication module also sends rules 

20 and profile information to other agents, requests for information to the central SDI server, 

21 etc. In Figure 9 we look inside the personalization agent's control module, which is the 

22 module that receives requests for information from the communication module, and checks 

23 whether the agent has the type of information that the agent requests, and then recovers the 

24 profile for the requesting agent and sends that to the processor. The processor selects a rule 

25 from the rule database to use in deciding what data to send to the requesting agent, and can 

26 also select a method from the rule database to accompany the data when it is sent to a 

27 requesting agent. 

28 The control module receives a request for information from the communications 

29 module, and checks with the information database for its capabilities to provide information. 

30 A query can also specify no particular information. The next step is to consult the profile 

3 1 database, and investigate whether the agent already has profile information about the user, 
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1 which it can used in the processor to select a rule that will decide which information to push 

2 to the agent. 

3 The processor returns a rule, and the control module consults the information 

4 database, and recovers the required information. Finally, the control module forwards the 

5 information, and optionally a method to further process the information at the requesting 

6 agent with profile information stored locally to the requesting agent. Figure 1 0 illustrates 

7 the role of the processor module in the personalization agent. The module receives the 

8 request for information from an agent, and any profile information that the personalization 

9 agent already has about that agent or that is provided with the information request. The 

10 processor consults the rule database, and selects a rule with which to decide what 

1 1 information to send to the agent. The processor can also decide to send a rule with the data, 

1 2 that will be processed on the client of the agent making the request for information based on 

1 3 local profile information to that agent. 

14 The decision module within the processor (Figure 10) selects a suitable rule, and 

15 decides what information to provide to the agent. This rule is passed to the threshold 

16 testing module. This is where a decision is taken about whether further customization 

1 7 should be performed local to the requesting agent, based on its local profile information. 

1 8 If the rule provides sufficiently well personalized information then the processor instructs 

19 the control module to process the information with the rule, and send the results back to 

20 the information gathering agent. Otherwise, for example in the case that the 

21 personalization agent does not have accurate profiling information about the requesting 

22 agent, the processor will instruct the control module to partially process the information 

23 as best as it can, and then send that processed information and an additional rule to the 

24 agent for additional processing. 

25 9. Data ManagementThe invention of SDI, in its most general form, refers to the system for 

26 a privacy-protected market for data interchange between multiple parties. In this section we 

27 discuss possible methods for an agent to manage its disclosure of data to the data 

28 interchange, and to other agents in the system. We allow agents to adopt different identities, 

29 and submit different information under different identities. The idea is that a pseudonym 

30 prevents other agents from exchanging information. Given an identity management policy, 
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1 the data management policy is then used to determine what data to submit to the central data 

2 server, or release to other agents, under a particular pseudonym. The agent needs to choose 

3 what data to release, and what price rules to associate with that data. We propose client-side 

4 "proxy agents" to intermediate the interactions between agents, and manage data provided to 

5 the central SDI data warehouse; and methods to manage an agent's identity in interactions 

6 with other agents, for example via pseudonymous and anonymous interactions. The proxy 

7 agents control the amount and types of information exchanged between agents. We refer to 

8 the proxy as "client-side", because it resides on the machine local to the agents that 

9 participate in the system of SDI. The client-level SDI proxy, implemented as a client 

10 program running on the user's client machine, manages all data release from the user to 

1 1 other agents and to the central SDI data warehouse. The proxy might usefully provide a 

1 2 rule-based interface to allow a user to select appropriate data management policies. 

13 An important application of SDI is to a system for business-to-consumer (B2C) e- 

14 commerce, where SDI allows individuals to provide vendors with access to profile information that is 

1 5 collected client-side about an individual as he/she interacts with vendors, but retain control over the amount 

1 6 and level of detail that is made available and collect payment for that information. A key function of the 

1 7 client-level proxy is to control the ability of multiple vendors to track a user, by control of the identity of a 

18 user in its interactions and the information released. The client-side proxy agent maintains profile 

1 9 information about a user, but releases that information according to a user-defined policy. The system of 

20 SDI, together with these client side proxy agents, allows a user to control the amount of information that is 

21 released to vendors and other agents, and allows users to exchange information within a privacy-protected 

22 and carefully controlled market place and receive personalized products and services The proxy might 

23 usefully provide a rule-based interface to allow a user to select appropriate data management policies. 

24 Vendors can run collaborative-filtering type applications on profile data from many online consumers, and 

25 use that information to personalize products and services in the future. Vendors can continue to build 

26 collaborative filtering models and other personalization models that used extended profiles, and users can 

27 receive the benefits of personalization without releasing profile information to a vendor; e.g. by 

28 personalization within the central SDI query-execution module or by personalization on a user's client 

29 machine. 

30 9. 1 Overview The basic assumption is that there is no technical method to prevent vendors 

3 1 from sharing information once they have that information, but it is possible to reduce the 

32 value of that information and the privacy implications for a user by making it very difficult 

33 to determine that two agents represent the same user in two different interactions. 
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1 The ability to interact pseudonymously with other agents, and adopt a unique 

2 (temporary or persistent) pseudonym for each agent-agent interaction, is not in itself novel. 

3 This is described in the literature, for example as early as the work of Chaum [Chaum 85], 

4 and others more recently [FR 98; Cranor 96; BGGMM 97; BGGMM 98]. What is novel is 

5 the central SDI privacy-protected marketplace, that respects metainformation that agents 

6 associate with data, to carefully control the availability of data to other agents. Also novel is 

7 the method of 'distributed query execution' (Section 8), that allows an agent to push a 

8 method to another agent, and take a decision based on private information without learning 

9 the information. 

10 In Section 9.2 we present example identity-management policies. At one extreme, if 

1 1 an agent interacts with every other agent under a unique pseudonym, and never allows two 

12 agents to match its identity based on information that it releases, then the other agents have 

1 3 no way of combining information about the agent. In the business-to-consumer e-commerce 

14 example, vendor A cannot share information about consumer Z with vendor B unless 

1 5 vendor B has some way of linking its customer with the customer of vendor A. This can be 

16 done at present via cookie mechanisms and advertising networks (e.g. Doubleclick), or 

17 simply by matching identifying information (e.g. user name, e-mail address, credit card 

1 8 number) across sites. 

19 In Section 9.3 we describe data-release policies, which are methods that determine 

20 what information is released for each pseudonym a user selects, and the rules associated 

21 with accessing that information. We also note that an agent can choose to introduce a level 

22 of noise-perturbation to data before its release, so that the data is still useful but cannot be 

23 used to link an agent across multiple pseudonyms via the data that it releases. Alternatively, 

24 an agent can release data with a reduced level of accuracy, for example within a range of 

25 values, and/or restrict the types of data that are released. Methods of data perturbation are 

26 discussed in more detail in Section 10. Finally, note that there are two parts to the 

27 data management policy. One relates to the data that a user releases to the central SDI data 

28 warehouse under a particular pseudonym, and another relates to the data that an agent 

29 releases to another agent in a direct agent-agent interaction. The following bullets 

30 summarize the distinction, and note the sections for more details. 
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1 o [9.3: Data Management Policy] Manage the Release of Data to the Central SDI 

2 data warehouse (with either physical uploading of data, or release of links to 

3 physically distributed data); this includes the association of price-rules with data 

4 that is provided to SDI. 
5 

6 o [9.4: Agent- Agent Data Management] Manage direct Interactions with other 

7 Agents, for example controlling the identities used in interactions, and the amount 

8 and type of information released. This is really a special-case of Sections 9.2 and 

9 9.3 for data release to the central SDI database. 
10 

1 1 In addition to the identity and data management policies, an extension of SDI 



12 could allow statements about "fair use" of information, much as is developed in the W3C 

13 P3P project [W3C; CR 98; RC 99], that allows a user to state for example whether or not 

14 it would like to receive electronic mail in response to information, whether the data can 

1 5 be exchanged with other agents, whether it can be used for personalization. "Fair use" 

1 6 policies are typically protected by "catch and punish" mechanisms, for example with spot 

1 7 checks by special interest groups on business practices, and not with technical solutions. 

1 8 In comparison, we focus on technical solutions to allow a user to control information held 

19 by other agents; for example by careful identity management, careful data management, 

20 and careful metadata annotations to control queries in the central SDI database. 

21 9.2 Identity-Management Policy Identity management policies allow an agent to 

22 assume pseudonymous or anonymous identities in its interaction with other agents, for 

23 example in the context of consumer-to-business commerce preventing two vendors from 

24 exchanging information about the same agent on the basis of the agent's revealed identity. 

25 An identity-management policy defines an appropriate identity (pseudonymous or 

26 anonymous) for different agents with which an agent might interact. 

27 A pseudonym can be unique for each agent-agent interaction pair, in which case 

28 another agent can store information about repeated interactions but cannot exchange that 

29 information with other agents on the basis of the pseudonym, because the pseudonym is 

30 unique. A stronger form of privacy is provided with an anonymous identity, that does not 
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1 allow even the same agent to track the agent across multiple interactions. An anonymous 

2 identity exists only for a single session — until either party agent changes context and 

3 connects to another agent. A pseudonymous identity system can be implemented using 

4 standard cryptographic techniques known in the art. For example, the JANUS/LPWA 

5 system [BGGMM 97; BGGMM98] allows a client-side proxy to compute a new pseudonym 

6 for an agent, and also a new e-mail address and user-name and password for that 

7 pseudonym, to allow access to web-pages that require user log-in. In particular, it is possible 

8 to compute new pseudonyms for a user without a central register of pseudonyms that could 

9 compromise a user's identity, through the technique of "blinded signatures" [Chaum 85]. 

10 The identity-management policy can be configured by a user when he/she first 

11 registers with the system of SDI; for example a simple policy might classify agents 

12 according to the certificates that they can present, and select a pseudonym according to the 

13 agent's class. 

1 4 Other reasonable identity-management policies include: 

15 o Absolute Privacy. At the highest level of privacy an agent interacts anonymously 

16 with every other agent so that agents cannot learn about the agent across sessions, 

17 and cannot personalize future interactions. Furthermore, information about the 

18 transaction with the agent, cannot violate a user's privacy when provided to other 

19 agents, so long as the agent does not release any identifying information during its 

20 transaction. 
21 

22 o High Privacy. At the next level of privacy, an agent interacts with every agent under 

23 a unique persistent pseudonymous identity. This prevents another agent exchanging 

24 information about the agent with other agents, unless the agent reveals other 

25 identifying information. However, this policy does allow agents to receive 

26 personalized information over multiple sessions with the same agent, directly 

27 without the vendor using information submitted to the central SDI data warehouse, 

28 or stored on a client-side single-user database. Another agent can personalize 

29 information, but only on the basis of previous transactions with the agent. In an 
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1 Internet environment this mode of interaction allows a vendor to track its customer 

2 across multiple sessions. 
3 

4 o Medium Privacy. Use the same persistent pseudonymous profile with groups of 

5 other agents, perhaps segmented according to the type of task that an agent is 

6 performing. With this level of privacy agents still control of the ability of other 

7 agents to build portfolios of information. Examples: (a) an agent might specify one 

8 identity for whenever it is purchasing books, so that any book seller agent that it 

9 approaches can access profile information that relates to the agent's previous book 

10 purchases and queries (so long as someone will provide/exchange/sell/rent that 

1 1 information to the agent); (b) an agent might require that a different pseudonym is 

1 2 used for business-related transactions and leisure-related transactions, to allow more 

1 3 precise personalization; (c) consider a family that interacts with other agents through 

14 the same SDI proxy. Each member of the family might choose to use a different 

1 5 pseudonymous persistent identity for all of his/her on-line transactions. 
16 

1 7 o Low Privacy. No attempt is made to protect the identity of an agent. This is the 

1 8 default privacy level provided via current Internet browsers, at least in the common 

19 usage of individuals. The system of Secure Data Interchange is useful with this level 

20 of privacy only to the extent of leveraging the value of information that has not been 

21 released to any agent. There is not technical method to prevent agents from 

22 exchanging information that relates to transactions they have performed with the 

23 same agent. 
24 

25 A general method to select a "medium" privacy strategy, with different pseudonyms 

26 for different agents, but some shared pseudonyms, is to suppose that agents present 

27 certificates. A certificate places an agent into a.particular class, and each class is associated 

28 with a particular pseudonym. Therefore agents that can position themselves in a particular 

29 class that is shared with other agents can share profile information; while an agent that can 

30 not position itself within a shared class will receive its own unique pseudonym. 
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1 A useful default policy is to suggest that an agent adopts a unique pseudonymous 

2 identifier for each vendor that they interaction, and then explicitly link pseudonyms as they 

3 decide, via a link-command to the central SDI database. The policy is a hybrid of the core 

4 suggestions above. Initially a user selects high privacy, but the user can choose to 

5 incrementally relax this privacy decision on the basis of continued interactions with agents. 

6 A user might also sell links between profiles, to agents that can present the right 

7 characteristics and pay the right price. 

8 

9 Example in Consumer-Business On-line Commerce. Figure 11 illustrates agents 1, 2 and 3; agent 1 

10 maintains one pseudonym for vendors 1 and 2, and a different pseudonym for vendors 3 and 4; agent 2 

1 1 maintains one pseudonym for vendors 3 and 6; agent 3 maintains a unique pseudonym for each of vendors 

12 1, 3 and 5. Of course, a vendor might adjust its level of service according to the level of 

1 3 identification provided by an agent. For example, a vendor might provide a more complete service to a user 

1 4 that provides a pseudonym than a user that provides an anonymous identifier, because a pseudonym allows 

15 a vendor to collect information over an extended period of time and build a detailed profile that can enable 

1 6 future personalization. 

1 7 9.3 Data-Release PolicyThe client-side SDI proxy provides support for 

1 8 automatic submission of profile data, and other types of data, to the SDI central data 

19 warehouse. This includes policies for pricing queries, and policies for introducing 

20 random perturbations to data. 

2 1 There are a number of human/computer interactions that are important in allowing a 

22 user to configure these options. . Many of the issues were anticipated in Cranor's work [CR 

23 98; RC 99], in which she describes methods adopted in the W3C P3P (platform for privacy 

24 preferences) project, and suggests that users can as a first step select a special interest group 

25 with which they are affiliated, use that as a base policy, and then adapt the policy as 

26 necessary. Alternative techniques include decision-tree methods as described in pending 

27 patent [INSERT US PATENT NUMBER AND NAME], that ask a user a number of 

28 questions in order to ascertain an appropriate service. There are many dimensions that a user 

29 might like to identify; for example: the type of information that can be released, the types of 

* 

30 queries that can be performed on that data, the price that must be paid to perform the 

3 1 queries. For example, we might hard code different price and data-release policy codes, e.g. 
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1 A B C D. . ., and allow third-parties to provide maps between data types and an appropriate 

2 policy code. 

3 We view an agent's complete data release policy as an intersection of its data 

4 management and identity management policies. As with identity management, a simple 

5 variation will define a fixed number of data management policies, stating types of data 

6 that can be released and conditions, and a mapping from agent certificates to policies. An 

7 agent can receive whatever policy it presents a certificate for. 

8 9.3. 1 Data Validation The SDI client-side proxy can also provide a useful service of 

9 validating data in profiles. This can be important in scenarios in which it is possible that a 

1 0 user can try to misstate information in the database for personal gain. For example, consider 

1 1 a consumer-business e-commerce system in which vendors determine user discounts for 

1 2 products based on their profile information. Clearly all users will attempt to adopt the profile 

13 for the cheapest price, if that is known, and if that is possible. We can allow vendors to 

1 4 specify that they will only follow validated information, for example if a user's profile states 

1 5 that he/she is very price sensitive, then it is more useful to know that if the assessment is 

1 6 based on validated transactions performed via the SDI system. In general terms, information 

1 7 is valuable within a marketplace if the information is accurate, and we can use client-side 

1 8 validation to achieve that goal. 

19 Consider another business-consumer e-commerce example. In general it is 

20 useful to allow an agent to duplicate information across different identities in the SDI 

21 data warehouse, e.g. my Zip code might well be information that I am very happy to 

22 associate with all of my profiles because it is not too revealing of my identity. However, 

23 as a vendor I might like to know that when I execute a query I am not paying to receive 

24 duplicated data records for the same user. One useful way to prevent this is to allow a 

25 user to also state with some records "this is the only user profile for which I am choosing 

26 to submit this piece of information". For example, if user A purchases book X then the 

27 user might choose to associate information about that book purchase with just one of 

28 his/her profiles in the SDI data warehouse, e.g. the profile for the "book-reader". Stating 

29 this allows a vendor to specify within its query that it is only interested in receiving 

30 information associated with profiles that include this "exclusivity" claim. The SDI 
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1 client-side proxy can provide a guarantee that the user only associates data with one 

2 profile, for example using a cryptographic method, e.g. signing the data record with a key 

3 to indicate that it is a unique record. 

4 9.3.2 Data Perturbation In addition to an agent's identity management and data- 

5 release policy an agent must be careful that it does not compromise its identity-management 

6 policy by releasing identifying information. For example, an agent must be careful not to 

7 release the same piece of information X under pseudonyms PI and P2 if it is unlikely that 

8 the information would relate to two different agents. In the next section we discuss the 

9 concept of data perturbation in some detail. The idea is that an agent must add enough noise 

1 0 to any information that is released to prevent identification by another agent, unless the other 

1 1 agent already knows the agent's true identity. This date-perturbation module can run on top 

12 of identity and data-release methods. Without careful management of the control of 

13 information a pseudonymous identity management policy is redundant. Remember, 

14 whenever pseudonymity is compromised there is no technical method to prevent vendors 

1 5 and other agents from exchanging information about the agent. 

1 6 Agents are careful to reveal only information that will not allow a vendor to link the 

1 7 identity of a user across multiple pseudonyms, defeating the identity management policy. 

1 8 Pseudonymity can be broken whenever an agent reveals the same piece of information, X, to 

19 multiple agents, e.g. A and B, and that information X has significant discriminative power. 

20 Agents A and B might be able to deduce with quite high probability that it is likely that the 

21 two pseudonyms refer to the same agent; and therefore combine their profile information 

22 and defeat the user's profile management policy. 

23 Consider an extreme example in business-to-consumer e-commerce: if agent 1 

24 reveals its social security number to vendors 1 and 3, its pseudonymity is broken. Vendors 1 

25 and 3 can now communicate and deduce that the agent that interacts under pseudonym PI 

26 and pseudonym P2 is the same agent, and therefore share profile information that the agent 

27 would like to isolate from each agent. The pseudonymity is broken in this case because 

28 social security numbers are unique identifiers. A slightly more subtle example is to consider 

29 an agent that provides a combination of profile information to two vendors, where the 
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1 information taken together is suitably revealing that the vendors can conclude that the 

2 pseudonyms relate to the same user with high probability. 

3 At the other extreme, we might simply provide no personal identifying information, 

4 such that in a business-consumer e-commerce application the vendor knows only the type of 

5 product that the agent is looking for, or the request for information made by the agent. In 

6 this situation we can use the methods disclosed in [SECTION XXX, END OF BOTTOM- 

7 LEVEL] to complete a transaction with pseudonymous physical mail and pseudonymous 

8 payments. 

9 A slightly more advanced method is to classify information as identifying and non- 

10 identifying, where information is non-identifying if no amount of the information can break 

1 1 a user's identity; e.g. information common to many other users, perhaps the city in which a 

12 user lives, a rough salary range for a user, a user's banking institution, etc. In comparison, 

1 3 identifying information might include a user's street address, a user's favorite book, a user's 

1 4 exact salary, etc. 

1 5 The data-perturbation approach is to select the information to release, via an agent's 

16 data-release policy, and then add a necessary amount of random noise to that information to 

17 protect a user's identity. We can add a small amount of random noise to data, enough to 

18 prevent identification, but not so much that the data has no value to another agent. For 

19 example, my year of birth and Zip code might be almost as valuable for customization 

20 purposes as is my full date of birth and street address. 

21 9.3.3 Click stream data. One type of data that is especially interesting in a browsing 

22 environment is click stream data, which is stored at the client machine and represents a 

23 sequence of clicks that a user has executed, possible across multiple vendors. The data can 

24 be stored on a user's local client machine and periodically released under an appropriate 

25 pseudonym to the central SDI data warehouse. The client-level proxy server that runs on a 

26 user's host machine is in a unique position of being able to monitor the user across different 

27 pseudonyms and across different vendors' sites. The client proxy might also collect 

28 information about: 
29 
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1 o The information that is displayed to a user (e.g. the text, the pictures, etc.) 

2 o Information typed at the keyboard, and profile information transferred from the 

3 client machine to a vendor. 
4 

5 The data is gathered by passively observing the actions of the user, and not by direct 

6 question-and-response. Possible click stream data release policies include the following 

7 data-release policies: 
8 

9 A. Release no information. 

10 B. Only release data on the URLs of the most recent sites visited. 

1 1 C. Release data about the URLs of the most recent sites visited, and the information 

1 2 displayed to the user. 

13 D. Release data about the URLs, the information displayed, and the information 

1 4 entered by the user. 

1 5 in combination with a suitable identity-management policy. For example, an agent might 

16 state that all click stream data should be released under the pseudonym that the agent adopts 

17 for a particular transaction, i.e. all click stream data with amazon.com should only be 

1 8 associated with my "buying books" pseudonym. An alternative policy might state that click 

1 9 stream data should be stored under a unique pseudonym, and not identified with any of a 

20 user's other pseudonyms. The agent might then allow a vendor to perform limited queries 

21 on that data, for example only query information related to particular domain names. Many 

22 variations are possible: e.g. use a unique pseudonym for each new URL domain, i.e. 

23 whenever a user skips to a new site, submit click stream data under a new pseudonym. 

24 Click stream data can be subject to random perturbation, just like standard data- for 

25 example removing time-stamp information and adding noise to the URLs that a user clicks. 

26 There may be other click stream data release policies worth considering outside 

27 the preferred embodiment discussed above. For example, Intermind's patent number 

28 XXXX, entitled XXXX, provides for the release of a user's information according to the 

29 terms and conditions of the user's own data disclosure policy. 
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1 Such an arrangement could be further enhanced by conditioning the 

2 quality of the data released by the vendor to the user on the strictness of the user's own 

3 data disclosure policy — users could then be rewarded to share more of their clickthrough 

4 data. Various levels of security could also be guaranteed to the user releasing such 

5 information - for example the data perturbation technique could be used to ensure that 

6 the vendor only receives aggregate information about his visitors. 

7 9.4 Agent- Agent Data Management It is important that an agent that submits data 

8 to the central SDI data warehouse also maintains careful control over the data that is directly 

9 released to other agents through one-to-one interactions. It is necessary to prevent a "black- 

10 market" in data. Furthermore, a user might simply prefer that another agent (for example 

1 1 representing a vendor) does not know certain pieces of information. With respect to 

12 preventing a black market in information about a user, it is important to prevent two 

1 3 different agents linking the identity of an agent under two different pseudonyms, based on 

14 comparing information provided by an agent under each pseudonym. We do not propose a 

1 5 technical solution to prevent these agents exchanging information (outside of SDI) about the 

16 agent when this type of linking is possible. Such violations can not only affect a user's 

17 privacy, but might also decrease the value of information provided to the SDI data 

18 warehouse, since information can be readily exchanged between agents outside of the 

1 9 constraints of SDI. Fundamentally, the policy under which an agent handles data release 

20 to another agent is no different from the policy with which data is released to the central SDI 

21 data exchange. This is a special case, in which the agent knows that it is providing explicit 

22 permission for an agent (specifically the agent with which it interacts) to receive all 

23 information that it releases. As indicate above, the pseudonym and the data release to 

24 another agent can be determined by certificates that the agent is able to present, and an 

25 agent's local data and identity management policies. 

26 For example, suppose that the policy defines that certificate A confers the right 

27 to receive a persistent pseudonym, and also receive links to the pseudonyms in set PL Suppose the 

28 certificate B confers the right to receive an anonymous pseudonym, and receive no links, and certificate C 

29 confers the right to receive a persistent pseudonym, and links to pseudonyms in set P2. Finally, suppose an 

30 anonymous profile is the default. Now, suppose a vendor presents certificates A and C. This vendor 

31 receives a persistent pseudonym, and links to pseudonyms in the union of sets PI and P2; a vendor that 
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1 presents A and B receives a persistent pseudonym and links to pseudonyms in set PI; and a vendor that 

2 presents no certificates receives an anonymous profile and no links to other pseudonyms. 

3 9.5 Light Clients: Web-Centric Data Management In this section we describe a 

4 variation that pushes data management to a centralized web-server, so that "light" mobile 

5 computing devices can achieve the same functionality as a more powerful desk-top 

6 computer. The convergence of telecommunication and computation technologies is leading 

7 to many new mobile devices, but they often have limited computational resources — 

8 processing power, memory, storage space, battery life-time. It is an interesting problem to 

9 provide these mobile devices with general computational abilities, for example leverage 

1 0 centralized computation to do such compute-intensive tasks as data mining and information 

1 1 extraction. 

12 We explain how the functionality of the client-side proxy agent, for example in 

1 3 managing the identity and profile information of a user as he/she interacts with other agents 

14 in a networked environment, can be pushed to a central server in the system of Secure Data 

15 Interchange. This is useful in the case of a light client with limited data storage and 

16 information processing capabilities. This enables a user to maintain a persistent SDI profile 

17 across different client machines, for example at work and at home. The profiles and 

18 pseudonyms that an agent uses to interact with other agents can be stored in a central 

19 computational server, with remote user access. Alternative technologies, such as smart-card 

20 technologies, might allow information about an agent to be physically stored with a mobile 

2 1 agent, but use central servers and local compute devices to perform computation. 

22 The first step is to support user-identification, for example via a mobile key-card 

23 [Chaum 85; Chaum 91], password and question-response, a voice-test, an iris-scan test, 

24 etc. With this, the client's proxy agent can authenticate its identity to the SDI profile 

25 management server, and request that the profile manager initialize a personalized 

26 interaction session with another agent. Consider for example a physical SDI device in a 

27 store, that requests notification whenever agents with light SDI clients are in its physical 

28 proximity. At this stage the light client and initiate an exchange of information between 

29 the SDI store agent and the SDI central servers, and receive the benefits of a personalized 

30 interaction. 
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1 Figure 12 illustrates a sequence of steps that can allow a light client to execute a 

2 customized interaction with another agent. Suppose agent 1 is a light client. Initially 

3 agent 2 contacts agent 1 , requesting profile information so that agent 2 can provide a 

4 customized service to agent 1 . Agent 1 receives the request, and contacts the SDI profile 

5 management server, which implements the functionality of an agent's data management 

6 and identity management policies, and provides an identifier for the appropriate agent 

7 pseudonym to agent 2. Agent 2 can then request information about agent 1 directly from 

8 the central SDI server, and finally provide personalized information and services to agent 

9 1. 

10 The pending patent application "A System for Location Enhanced ..." [Patent 

1 1 Application #09/3 14,32 1 ] describes a method to identify a user based on location, which can 

12 allow automatic detection of close SDI-enabled agents based on physical proximity. The 

13 methods taught in the LEIA patent also suggest the use of a common user identifier, which 

14 could via simple look-up act as the Unique User identifier in the central SDI database. Other 

1 5 biometric variations, e.g. via Iris scan technology promise to allow personalized interactions 

16 between people and other agents that they approach in the physical world, with sessions 

1 7 automatically configured based on identification and principles of consent. Location based 

1 8 filtering can be further used to allow relevant information to be delivered to a user based 

19 upon his/her physical location. It is also possible to pre-cache personalized information 

20 directly to the user's client (and server) thus overcoming the bandwidth bottleneck or 

2 1 wireless connections. 

22 Web-centric profile management can be used in an environment of mobile 

23 users that interact with many different computational devices, for example ATM 

24 machines, point-of-sale terminals, etc. A simple variation of SDI allows a user to allow a 

25 machine with which he/she interacts to be automatically configured for a personal 

26 session, based on carefully controlled queries from that machine to the central SDI data 

27 warehouse. A common 6t user identifier" be it cards, codes or biometrics can be used to 

28 identify and personalize the local SDI client machine. The user may also be identified 

29 during off-line transaction via his/her credit or debit cards, for example on ATM 

30 machines and POS kiosks, providing promotional offers and coupons. 
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1 9.6 Smart Cookies 

2 Current practice in web-browser based consumer-to-business electronic commerce is 

3 to use cookies, which are identifiers placed on a user's hard drive, to identify a user across 

4 an extended period of time. For example, if I access the New York Times from my home 

5 personal computer on Monday, and then again on Tuesday, the New York Times server can 

6 identify that I am the same individual and build a profile of my interests, i.e. the new stories 

7 which I choose to receive first each day. This type of information about all of the users that 

8 read the Times can allow collaborative-filtering type techniques and personalization of 

9 information in the future, such that my "front page" is different from the front page of 

1 0 someone else. 

1 1 However, cookies have the unfortunate side effect of allowing an individual to be 

12 tracked across the web pages of different vendors, for example across the web page of the 

13 New York Times ( www.newvorktimes.com ) and Amazon ( www.amazon.com ), if the 

14 Times and Amazon both embed content from the same third party in their pages. This 

15 happens, for example, with the DoubleClick advertising network. Doubleclick 

16 (www.doubleclick.com) operate a virtual network of pages, and can track a user across any 

17 page within their network, and gather a very comprehensive user profile. Although the 

1 8 cookie mechanism is designed so that only vendors with the same domain name can access 

19 cookies on a user's hard drive, they can easily be used to profile users across multiple 

20 vendors, for example with a double-click style network that embeds a universal advert 

2 1 server within each page. 

22 The system of SDI allows controlled personalization, such that a proxy-automated 

23 log-in session where the proxy presents a user's pseudonym to a vendor allows that vendor 

24 to track a user over time at its own web site, but a vendor cannot track a user across web 

25 sites. The SDI client-side proxy agent will disable cookies in their current form (although 

26 introducing "smart cookies" to retain the initial functionality provided by cookies in 

27 Netscape's description, that of a stateful interaction with a user). Furthermore, vendors can 

28 still perform collaborative- filtering style data-mining across different data sets from multiple 

29 vendors, but only with the explicit authorization of owners of that information, as presented 

30 by price-rules associated with that data in the main data warehouse. 
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1 Netscape, Inc. [see Netscape.com technical notes for developers] have proposed a 

2 method called "safe cookies" to retain the stateful interactions that are an essential part of a 

3 good e-commerce interaction (supporting shopping baskets, for example), but without the 

4 drawback of tracking users. Our smart cookie mechanism is similar, allowing stateful HTTP 

5 sessions without allowing a user to be profiled across multiple sessions. In its basic form 

6 HTTP is anonymous, there is no way to differentiate one request for information from 

7 another request for information. The cookie mechanism was designed to overcome this 

8 problem when it is useful to display consistent information to a user over a sequence of 

9 "clicks", for example to implement a "shopping basket" in a on-line e-commerce site. An 

1 0 alternative mechanism is to embed codes in the URL itself, with the "+ code" mechanism. 

1 1 We propose a smart cookie, that is stored on a user's hard drive and provided with every 

12 HTTP mechanism sent to a vendor, but deleted as soon as the user changes his/her 

13 current context and goes to another URL domain name. For example, the New York 

14 Times can write a smart cookie to the user's hard drive to associate that user with HTTP 

1 5 requests while the user does not change context and go to another web site. However, as 

16 soon as the user enters the amazon.com URL into his/her browser the cookie is 

17 permanently deleted from the user's hard drive. Essentially, the solution must allow only 

18 temporary cookies, which expire at the end of a "session", as defined by the client 

19 machine. The temporary cookies provide stateful interactions in a TCP/IP system as with 

20 cookies, but expire at the end of a session. 

21 Cookies are also used at present to allow a user to automatically and seamlessly 

22 log-in to a web page. We can avoid any loss of convenience to a user with automatic 

23 methods for log-in to remote servers, for example via the Janus/LPWA [BGGMM97; 

24 BGGMM98;GGMM98] or W3C proposals [CR98; RC99; W3C-OPS 97]; essentially the 

25 pseudonym and an associated password that a user adopts for a vendor allows log-in to 

26 that vendor. 

27 Also important when supporting a system that allows a user to browse pseudonymously is that 

28 the physical attributes of a network system are removed of their identifying characteristics. For example, 

29 another role of client-side data management agents is to strip the 'from' field in a HTTP/TCP message. 

30 9.7 Implementation Details In an Internet browser environment the client-side proxy 

3 1 agent that provides distributed data management for an agent might be implemented as a 



Page 89 of 370 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 plug-in into the browser, that can for example be downloaded from a central SDI server. The 

2 browser is then configured to use the SDI proxy as its proxy, and the SDI proxy itself 

3 connects through a user's ISP (or other intranet gateway) to the Internet, and on to other 

4 vendors. 

5 The user provides his/her SDI proxy with personal information, such as his/her 

6 name, mailing address, and e-mail address. The client-level proxy registers then registers the 

7 user with the central SDI server, providing the server with the name, address and e-mail 

8 address of the user. Other basic user information might include demographic information, 

9 for example a users job, marital status etc. The client proceeds to automatically generate a 

1 0 unique SDI user ID code, and a private key to allow future authentication of its log-in. 

1 1 The client can create a unique public key/private key pair. This key pair can be 

1 2 generated only once for a person, and although the central SDI user ID server does not know 

1 3 the key pair, the server can verify that a key pair is only generated once- because a new user 

14 must present proof of identity to establish an account. The client generates a unique user 

15 identifier, UUID, for example with the methods taught in [Chaum 85; Schneier 92]. The 

16 UUID can then be blinded and signed to certify that a user is registered with SDI, using 

1 7 Chaum's technique of blinded signatures so that the certifying agent does not the identity of 

18 the UUID that it signs [Chaum 85, Chaum 92]. The client-level proxy can remove the 

19 blinding factor, to compute a signed UUID that it uses when it is necessary to generate new 

20 pseudonyms and request new certificates. 

21 The client-level proxy can now sign messages with its private-key, and provide the 

22 signed to UUID, to verify that (1) the UUID represents a validated user; (2) it is the client- 

23 level proxy authorized to act for the user, because it has the private-key associated with the 

24 UUID. The client-level SDI proxy uses the private key to authenticate messages that it sends 

25 to other modules within SDI, such as Pseudonym administering servers. The unique user ID 

26 for a user does not carry any information about the user, its sole purpose is to provide a 

27 unique identity. The unique UUID can also be used to generate new pseudonyms for users 

28 that are certified one-time for a particular vendor, so that the user certifies that he/she 

29 maintains the same profile for all interactions with a vendor. 
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1 At this stage the central SDI server might verify the identity of the user, and also 

2 check that the user is not already registered with SDI. The method for verifying the identity 

3 of a user could include requesting that the user provides his/her social security number, or 

4 some other institutional solution that is used for this purpose. In the future we could 

5 envisage an electronic system for such an identity procedure, but the method might require 

6 for the user to execute this initial step in person with the presentation of a recognized photo 

7 ID. The central SDI user ID server maintains a database of all users that are registered with 

8 SDI, and checks that that the user is not already registered with the system of secure data 

9 interchange. The central SDI proxy may also provides the user with a signed certificate of 

10 some universal identifier, such as its Social Security Number, that the user can use to 

1 1 generate other certificates from certifying agencies, to be able to gain certificates under 

1 2 pseudonymous identities. 

13 9.7.1 Initializing Identity and Data Management PoliciesThe client-side SDI 

14 proxy might provide a rule-based interface to allow a user to select from a menu of 

15 defaults an appropriate profile management policy, or to configure with the aid of a 

1 6 decision tree an appropriate profile management policy for a user. Profile management 

17 policies define how a user will interact with various classes of vendors (depending on the 

18 nature of the business that the vendor is engaged in), the kinds of uses to which the 

19 transactional information that a vendor collects can be put to, and the amount of 

20 information that a vendor which collects profile information about a user is authorized to 

21 release. The client-level proxy manages a user's interactions with vendors, to keep them 

22 within desired policies. 

23 9.7.2 Generating a New Pseudonym The method of blinded signatures [Chaum 85; 

24 Chaum 92] provides a useful technique to generate new pseudonyms for a user, without any 

25 centralized database that stores pseudonyms. The client-level SDI proxy can generate a new 

26 identifier, comprised of a sequence of bits, that will be unique with a high degree of 

27 probability. The identifier is "blinded" and then submitted to a trusted-third party to be 

28 authenticated for use as a pseudonymous identifier. When authenticated the blinding factor 

29 is removed, and the final signed pseudonymous ID can be used as a new identifier. The 

30 signature can allow a user to associate certificates with that identifier, and also to verify that 

3 1 the user has only a single pseudonymous ID with a particular vendor. 
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1 Pseudonym administering authorities (PAS) cannot build dossiers of the 

2 pseudonyms, because users submit "blinded" identifiers. The only information that a PAS 

3 has is the list of unique vendors that a particular user has registered with. The pseudonym 

4 administering server can be operated by an agent with a trusted relationship with a particular 

5 vendor. 

6 Every SDI user has a unique identifier, a UUID, that is presented to the PAS with 

7 the new pseudonymous identifier to be validated. The PAS can verify that this is the first 

8 pseudonym for a particular vendor. Each Pseudonym administering server has a public key / 

9 private key pair (PKPAS, SKPAS) for each Vendor for which it validates new pseudonyms. 

10 A signed pseudonymous identifier, signed with the private key of PAS, verifies that the 

1 1 pseudonym is valid. Cryptographic techniques ensure that the signature cannot be falsified, 

1 2 and allow another party to verify the signature with the public key of the PAS. This enables 

1 3 a vendor that receives the pseudonym to validate that the pseudonym is unique for the user, 

14 to enable persistent interactions across multiple sessions. 

1 5 An agent can use its pseudonymous identity to initialize transactions with a vendor, 

16 and also to encrypt messages. However, it may be more efficient to exchange future 

17 messages with a shared key pair — this is more efficient to implement that an asynchronous 

1 8 key pair cryptographic solution. Messages can be encrypted with the shared key, that only 

19 the user and the vendor know. This (1) validates that the message is from the sender; (2) 

20 ensures that only the intended recipient can read the message.9.7.3 Automatic Client-Side 

21 Profile Management When a user clicks to a new URL the SDI proxy agent must 

22 determine (a) the identity to adopt with the vendor; and (b) the profile-management policy to 

23 adopt with the vendor. The first-step can be performed by local look-up, in a stored table of 

24 the vendors that the user has interacted with. Should the vendor be a new vendor, then it is 

25 necessary to implement the identity-management policy based on certificates provided by 

26 the vendor, and then either use an existing pseudonym or alternatively adopt a new 

27 pseudonym. 

28 With an appropriate pseudonymous identifier the proxy agent can connect to the 

29 vendor, sending the vendor its signed identifier to identify itself. The proxy server continues 
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1 by sending the data privacy policy for the user with this vendor, signed with the private key 

2 for the pseudonym. This serves three main purposes: (1) it demonstrates to the vendor that 

3 the user is indeed the owner of the public key PKP, because the user has the secret key that 

4 is associated with it; (2) it informs the vendor about the data-privacy policies that the user 

5 requires; (3) whenever the vendor submits information about transactions with this user to 

6 the central SDI server it must also submit this certificate to verify that it is following the 

7 user's guidelines. 

8 The client-side SDI proxy agent might also provide a "connection-certificate" to 

9 provide the vendor with certain abilities to access information about the user in the central 

10 SDI data warehouse, as described in the earlier section. When a user connects to a site and 

1 1 provides a certified public key, the first-level proxy server also provides a time-stamped 

12 certificate of connection, S( (PK*V, T), SKP), where T is the current time, PK*V is the 

1 3 unique public key of the vendor, and SKP is the secret key of the user for the pseudonym 

14 that it uses with the vendor. This "connection certificate" is used by the vendor to request a 

1 5 profile-release from the central SDI server. 

1 6 The proxy agent than implements the profile-management policy, as it relates to the certificates 

17 presented by the vendor. First, the agent might submit the basic profile information to the vendor, for 

1 8 example a user's age, nationality, state, sex — anything a user is happy for any vendor to know. This is the 

19 basic profile that is configured by the user during initial registration with SDL Later, the vendor might 

20 request profile information, that can be provided if the vendor is authorized to receive such information. 

21 9.7.4 Leveraging Existing Standards The architectural framework outline above 

22 can be implemented with a number of existing technical methods. One approach is to use 

23 the Extensible Markup Language (W3C-XML) to encode information exchanged between 

24 client agents and vendor agents, for example profile information and requests for 

25 information. This has been suggested by the W3C consortium. The extensible Markup 

26 Language (XML) proposal of the Worldwide Web Consortium working group on SGML 

27 provides an ideal standard for representing such information. XML allows meta-content to 

28 be included with documents, machine-readable information that enables documents to be 

29 processed by client software. Augmenting web documents with structured information 

30 enables clients to respond to requests for profile information. XML can represent rich data 
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1 structures, and that allows a grammar to be defined for information that allows data to be 

2 automatically verified for correctness. 

3 Example: An XML Representation for a Profile Request Message 

4 A vendor might request profile information from a client-side SDI proxy agent. A 

5 vendor sends a 'Request Profile' message, and the client-side SDI proxy agent responds 

6 with a 'Profile Update' message, that contains profile information, in accordance with a 

7 user's profile management policy. The request-response mechanism can be implemented 

8 using the standard HTTP Post/Response mechanism in conjunction with XML message 

9 types. The 'Request Profile' message can be represented in XML as: 
10 

11 <?XML version = "1.0" ?> 

12 <?xml : namespace ns = "http://www.sdi.com" prefix = "SDI" 

13 ?> 

14 <!doc> 

15 <SDI : Request > http://www.some_vendor.com </SDI : Request > 
16 

1 7 and the SDI proxy agent's 'Profile Response' message can be represented in XML as: 
18 

19 <?XML version = "1.0" ?> 

20 <?xml : namespace ns = "http://www.sdi.com/clientX12345" 

21 prefix = "SDI" ?> 

22 <!doc> 

23 <SDI:Update> 

24 <SDI:Profile> 

25 <SDI:Item> (1231, 0.453) </SDI:Item> 

26 <SDI:Item> (1041, 0.034) </SDI:Item> 

27 </SDI:Profile> 

28 </SDI:Update> 
29 

30 An illustrative Document Data Type (DTD) for an SDLProfile element type is presented in 

31 the next section. The XML messages are included in the body of standard HTTP 
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1 Post/Response messages. We limit the performance degradation caused by out-of-date 

2 profile information that is stored within web pages of on-line vendors by associating "out- 

3 of-date" time stamps with the profiles that are provided by the central SDI server. This 

4 mechanism is similar to the "expiration time" tag of a Netscape Cookie message. The 

5 frequency with which profile updates need to occur will depend on the speed with which 

6 profile information changes. The "out-of-date" time stamp can be included as an additional 

7 element in an SDI:Update message. 

8 The system as outlined above can be implemented within the current HyperText 

9 Transfer Protocol (HTTP), as a sequence of challenge/response pairs between clients and 

1 0 servers. The HTTP Post/Response mechanism allows clients and servers to exchange data, 

1 1 and this data can be an instance of an XML Document Type, within the body of a HTTP 

1 2 message. The HTTP protocol is the underlying mechanism, with SDI messages contained 

13 in the body of the HTTP Post and HTTP Response as XML documents. 

14 

1 5 Example: A Possible XML Representation of a User profile 

16 The World Wide Web Consortium (W3C) SGML working group developed XML 

1 7 (extensible markup language) to provide an open and extensible grammar for structured data 

1 8 [XML]. An XML document has an associated schema definition to enable an XML-enabled 

19 browser to validate the structure of XML data automatically. A Schema in XML is called a 

20 Document Type Definition (DTD), and defines the names of tags, their structure, and their 

21 content model. XML allows the DTD for an XML file to be identified through a Universal 

22 Resource Indicator [URI] in the header of the file (see below). XML also allows URIs for 

23 mobile code resources to be referenced, in order to enable a client to process embedded 

24 XML data. An XML document must be well formed, and in order to be well formed the tags 

25 must form a tree structure. In addition, the DTD allows the structure of an XML document 

26 (an instance) to be validated against a particular schema. Senders and receivers must only 

27 send valid SDI files. Each SDI message is a valid XML document. 

28 We provide an example XML instance and part of a Document Type Definition for 

29 use within the system of SDI. We assume in this example that profile information is 
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1 represented as a list of attribute-value pairs within an XML document. An attribute is 

2 defined by a numeric code, and the value defines the weight of the attribute. For example: 
3 

4 <?XML version = "1.0" ?> 

5 <?xml : namespace ns = "http : //www. w3 . org/OPS/OPS" prefix = 

6 "OPS" ?> 

7 <?xml : namespace ns = "http : //www. sdi . com" prefix = "SDI" 

8 ?> 

9 <!doc> 

10 <SDI : Prof ileData> 

11 <SDI :Location> 

12 <SDI :Geocode> 12321561 </SDI : Geocode> 

13 <SDI :DigiMap> http: //www. digimap/712321561 

14 </SDI :DigiMap> 

15 <OPS:Zip> 19103 <SDI : /Zip> 

16 <SDI : /Location> 

17 <OPS :Demographic> 

18 <OPS: Gender > F < /OPS : Gender > 

19 <OPS:Age> 26 </OPS:Age> 

20 <OPS:Income> 50000-75000 </OPS : Income> 

21 </OPS :Demographic> 

22 <SDI:ID> 

23 <SDI : Pseudonym> P12543 </SDI : Pseudonym> 

24 <SDI:PublicKey> 12453246129421 </SDI : PublicKey> 

25 </SDI:ID> 

26 <SDI:Profile> 

27 <SDI :Prof ile-item> (1242, 0.546) </SDI : Prof ile-item> 

28 <SDI :Profile-item> (56, 0.045) </SDI : Prof ile-item> 

29 </SDI:Profile> 

30 </SDI : Prof ileData> 

3 1 The Document Type Definitions for this document are specified in the header, and 

32 include URIs to a DTD of the Open Profiling Proposal of the W3C, and also a DTD of the 
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1 Secure Data Interchange. The OPS DTD is used to boot strap the SDI DTD, providing tags 

2 for common profile information, such as 'Gender', 'Age', 'Income', etc. The section of the 

3 SDI Document Type Definition that is used in the above XML fragment is presented below. 

4 It makes reference to tags defined in the OPS DTD, and the RDF (Resource Description 

5 Framework), a W3C proposal to standardize the structure of Digital Type Definitions for 

6 XML documents. XML Name spaces [NS] provide a method for unambiguously 

7 identifying the semantics and conventions governing the particular use of property-types by 

8 uniquely identifying the governing authority of the vocabulary, for example OPS and SDI in 

9 the example above. The URI for a schema can contain a human and machine-readable 
1 0 description of an XML schema. 

11 

12 <! ELEMENT SDI : Prof ileData (SDI : Location? , 

13 OPS: Demographic?, SDI: ID?, SDI : Prof ile?) > 

14 <! ELEMENT SDI: Location (SDI : Geocode? , SDI:DigiMap, 

15 OPS: Zip?, OPS: Address?) > 

16 < 1 ELEMENT SDI : ID (OPS: Name?, SDI : PublicKey? , 

17 SDI : Pseudonym? ) > 

18 <! ELEMENT Profile RDF : list<SDI : Prof ile-item> > 

19 < I ELEMENT SDI:Geocode # PCDATA > 

20 <! ELEMENT SDI:Digimap #URI > 

21 <! ELEMENT SDI : PubicKey # PCDATA > 

22 <! ELEMENT SDI : Pseudonym # PCDATA > 

23 <! ELEMENT SDI : Prof ile-item (SDI : Attribute-ID, 

24 SDI : Attribute-value) > 

25 < ! ELEMENT SDI : Attribute-ID # PCDATA > 

26 <! ELEMENT SDI : Attribte-value # PCDATA > 
27 

28 The tag '#PCDATA' is used here to represent numeric or textual information, '#URT 

29 declares that an instance of element 4 SDI:Digimap' must be a valid URI pointer. 

30 There are a number of other related W3C proposals. The W3C proposal [W3C- 

31 MCF] for a Meta Content Framework (MCF) suggests a particular structure for the 
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1 description language for web pages, to enable schema to be shared and re-used. This 

2 proposal is incorporated into the W3C [W3C-RDF] Resource Description Format standard 

3 (RDF). The W3C proposal [W3C-OPS] for an Open Profiling Standard (OPS) describes a 

4 system for profile exchange between two parties, building on XML and MIME standards. 

5 The W3C proposal on privacy and profiling [W3C-Privacy] on the Web extends the vCard 

6 [Vcard] schema for electronic business cards to include profile information, and suggests 

7 that profile information can be stored and managed locally, with Client-server exchange of 

8 personal information as required (using the HTTP challenge/response mechanism). The 

9 Resource Description Framework (RDF) enables the encoding, exchange, and reuse of 

10 structured metadata. RDF is an application of XML, with additional constraints to allow for 

11 Data Type Definitions (DTDs) to be published, and interchangeability across different 

1 2 communities. The ability to standardize the declaration of vocabularies will encourage the 

13 reuse and extension of semantics among different information communities [Mil 98]. RDF 

14 is a W3C proposed standard for defining the architecture necessary for supporting web 

15 metadata. RDF is an application of XML that imposes structural constraints to provide 

16 unambiguous methods of expressing semantics for the consistent encoding, exchange, and 

1 7 machine processing of metadata. RDF additionally provides means for publishing both a 

18 human-readable and machine-readable vocabularies designed to encourage the exchange, 

1 9 use and extension of metadata semantics among disparate information communities 

20 10. Methods for Data Perturbationln this section we introduce data perturbation methods 

21 to augment data values with randomized noise, to allow queries to tradeoff price and 

22 quality, and to allow a user to protect his/her identity by hiding revealing details of 

23 submitted data elements. 

24 The randomization of data occurs in two places in SDI, first a client-side SDI agent can be 

25 programmed to add noise to data as it is submitted to SDI, to provide an absolute guarantee 

26 that no malicious party, even for example an employee of SDI with privileged access to the 

27 SDI data warehouse, can benefit from highly sensitive information. Second, the data 

28 repository in the central SDI data warehouse can be programmed via price-rules for access 

29 to data records to allow randomized information to be released in response to queries, 

30 perhaps for a cheaper price. 
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1 The method of random data perturbation allows an agent to protect the 

2 pseudonymity of profile information which is released within the system of Secure. Data 

3 Interchange in response to queries executed on the central SDI data warehouse, and also 

4 with respect to information that is provided to other agents (e.g. from consumer client 

5 machines to vendors' servers in e-commerce applications) during direct interactions. 

6 Information is either adjusted slightly, through the addition of small amounts of noise, or 

7 rounded or binned in some way — to protect the identity of an agent if it is also releasing the 

8 same information under other pseudonyms. The goal in the method of data perturbation is to 

9 only adjust data by as much as necessary to protect the privacy of an agent, so that the value 

1 0 of data for personalization and user-profiling is retained. 

11 As an example, suppose a user lists its last 10 compact disk purchases, their prices, 

12 and the dates of purchases to two different on-line music vendors. Suppose the agent uses 

13 different pseudonyms with each vendor. Given information about the distribution of CD 



14 purchases over the population of agents, the vendors can compute the probability that the 

1 5 pseudonyms relate to the same agent. If it is quite likely that it is the same agent, the vendors 

1 6 can now exchange information about that user, and even collude on price. Clearly, the more 

17 revealing the information (e.g. providing more information, location of purchase, last 4 

1 8 digits of credit card, etc.), the greater the chance of identifying a user. 



19 However, the system of SDI is built around the concept of gaining value from the 

20 release of information, and maintaining careful control over the particular information 

21 available to other agents. The method of random perturbation allows information to be 

22 released without losing control over the ability of another agent to link the information with 

23 other pseudonyms that an agent might use. 

24 The main focus here is on random data perturbation, or rounding etc. as a method to 

25 prevent the identification of an agent and the linking of pseudonyms. This is critical to the 

26 successful operation of a system of pseudonymous data exchange, such as that implemented 

27 within SDI. 

28 Figure 13 illustrates how an agent's price rule can usefully contain an additive cost 

29 to reflect an agent's preferences to protect its protection level during query execution. For 
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1 example, highly sensitive information can have a high price, while other information can 

2 have a low price. This additive cost to account for an agent's protection against the linking 

3 of its pseudonyms can be universal across all data records. 

4 We propose a technique to allow a user to control information released and secure a 

5 certain protection level, which is an indication of the probability that its true identity can be 

6 recognized from information that it releases by an adversary with a perfect copy of the 

7 agent's true profile. The randomization method adds a level of noise to information to 

8 provide protection, based on the amount of information that is released and the information 

9 that is present in a database because of data records submitted by other agents. This is 

10 illustrated in Figure 14, which shows qualitatively that if there are many data records then 

1 1 the agent needs to add less noise to achieve the same level of protection as with less data 

12 records. 

1 3 The method of random data perturbation adds noise to data records to prevent an 

14 adversary with accurate information about the values of certain fields in the data record 

1 5 under another pseudonym from linking the pseudonyms, and reasoning that the two users 

1 6 are the same. Alternatively, we can round or bid data values, this has a similar effect. 

17 Basically, we add enough random noise to make the record sufficiently 

1 8 indistinguishable from other data records. There are two modes of protection, that depend on 

1 9 the amount of information known by the adversary: 

20 A. [Strong Protection] Assume that the adversary knows that an agent A_l with 

21 data records P submits data into a database; i.e. assume that the adversary 

22 knows for sure that one of the data records in the database relates to agent 

23 A_l. 
24 

25 Strong protection is only possible if there are enough data records from other 

26 agents to allow agent A_l to add noise to its own data and make it sufficiently 

27 like that one of the other data records is the data of A_l; in particular the other 

28 data records must also be submitted with random noise perturbation, or rounded, 

29 such that it is possible that other data records are submitted by agent A_L 
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1 

2 B. [Weak Protection] Assume that the adversary does not know that the agent 

3 A_l with data records P has definitely submitted data into the database. 
4 

5 Weak protection is easier to achieve. It is only necessary to add enough random 

6 perturbation to data to make the number of possible data records over the 

7 population of possible data records that are supported with the perturbed data 

8 record large enough to prevent agent identification. 
9 

1 0 Randomized data is still useful within SDI for data mining and other applications, so 

1 1 long as the amount of noise which is added to records is small in comparison to the value of 

1 2 a record. For example, we can still perform correlation across fields with randomization so 

1 3 long as the randomization does not destroy any trends between fields. Randomized data is 

14 marked as such within SDI, and labeled with the degree of degradation, so that SDI can be 

1 5 aware of the number of records to get relevant accuracy levels, and can report accuracy to 

16 customers. There is a tradeoff between the level of privacy protection and the level of 

1 7 aggregation at which responses to queries become accurate. It is possible to add random 

18 noise to data but still allow data that is aggregated across multiple records to be quite 

1 9 accurate, so that useful data mining can be performed. Binning or rounding of data does not 

20 have the same effect. For example, assuming additive noise and additive aggregation, then 

21 randomized data from a number of agents can be aggregated to obtain an aggregate value 

22 (e.g. mean) quite accurately as the number of agents increases for noise perturbation with 

23 zero mean. 

24 Random perturbation of individual data records can provide a cheaper and more 



25 secure alternative to cryptographic techniques, such as secure function evaluation, for 

26 providing information without compromising privacy [Schneier 92]. Randomized data is 

27 secure to computational attacks and the loss or theft of private keys-because we degrade the 

28 data, and make access to any one data item virtually useless. Cryptographic techniques 

29 known in the art provide methods to compute aggregates or other values from encrypted 

30 information without first decrypting this information. However, the general-purpose nature 

31 of these methods makes them unnecessarily cumbersome for complex problems. In 
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1 particular, the communication and computation requirements of these methods when applied 

2 to the problem of aggregation result in an unacceptable overhead on the system. 

3 Finally, all cryptographic techniques are subject to compromise by successful 

4 attacks on the cryptographic scheme or by the loss/theft of private keys. Such problems are 

5 present in all uses of cryptography. In comparison, the method of random perturbation is an 

6 information-theoretic technique, and not subject to the same problems. 10.1 Technical 

7 DetailsWe propose one possible method to select an appropriate amount of noise to add to 

8 data. The noise can be added in two places: 
9 

10 1) At the agent's client machine as data is first provided to the central SDI data 

1 1 warehouse 

12 2) In the SDI data warehouse as queries are executed, with the SDI query execution 

1 3 engine implementing data perturbation for agents. 

14 With numeric continuous data we can select an additive noise term, for example 

15 sampled for example from a Normal distribution. The amount of noise is computed to 

16 provide an appropriate degree of confidence that an adversary with exact information about 

17 its attributes cannot identify the individual. For example, in a Normal distribution, the 

18 standard deviation that is required depends on the number of agents that have provided 

1 9 similar information, and how sparse the information is in its domain. The standard deviation 

20 is "tuned" to provide the appropriate level of privacy. We do not require that all data is 

2 1 perturbed from the same noise distributions. 

22 The amount of noise perturbation to add to a data record can be computed using an 

23 adversary argument, where we assume that an adversary knows the true values of an agent's 

24 data, and may also (strong) know that the agent has submitted data to the database. We 

25 allow an agent to submit or release data with a random noise perturbation, and also can 

26 provide the service in the central SDI data warehouse in combination with price rules, to 

27 control the accuracy of data released to agents in responding to queries. An agent with true 

28 data X computes perturbed data X' with additive noise computed from distribution f, and 

29 submits the distribution f and the perturbed value X' . 
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1 The problem is more difficult with discrete data, but we can instead select a new 

2 discrete element at random from a close neighborhood of the original data point [Section 

3 10.4]. In addition, with multi-attribute data, different, independently chosen noise terms are 

4 used for each field of a record that needs to be perturbed by the addition of noise [Section 

5 10.5]. 10.1.1 Single Attribute and Strong Protection Agent A_l has data X to 

6 submit to a shared database, e.g. the data warehouse in SDI. Alternatively, we can assume 

7 that the agent is about to send the data directly to another agent. Suppose in this example 

8 that X is a continuous real number. 



9 Suppose that an adversary knows that agent A_l will submit data record X, and the 

10 goal of the agent is to submit data X with enough random noise perturbation to provide 

1 1 some protection against identification. The level of noise that the agent needs to add to the 

1 2 record depends on the other data records already present in the database. 

13 The problem is to select an additive noise distribution f that will be used to perturb 



14 the value X, and compute a perturbed value X\ Let eps > 0 denote the agent's desired 

1 5 privacy level, the probability that the value X' was submitted by agent A_l given the data 

16 in the data warehouse and that one of the data records was submitted by an agent with true 

17 value X. 



18 For example, suppose the distribution is f = N(0,1), i.e. a Normal distribution with 

19 mean 0 and standard deviation 1 and the value X = 1. Assume that the random number 

20 generated by f, noise = +0.3. Therefore the perturbed value is X' = 1.3, and this is the 

21 information that the agent submits to the database. Given this, an adversary can compute 

22 that a ex post distribution for the true value, g(X) = N(1.3, 1). 

23 Bayesian analysis can be used to compute an optimal parameterization for the 

24 random distribution, given a desired level of protection and information about the data 

25 already in the database, and the randomized perturbations that were added to that data. 

26 Assume that the database contains a set of data records submitted by agents, each with the 

27 submitted value X' and the noise distribution from which it was generated, f. With this 

28 information agent A_l can select an amount of noise perturbation. 
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1 The goal is to compute Pr(Agent = A_l | true value = X), i.e. the probability that the 

2 agent that submits (perturbed value = X', noise distr. = f), is the agent with true value X. 

3 Using Bayes rule, this conditional probability is computed as Pr(A_l) x Pr(true value = X 

4 | A_l) / Pr(true- value = X). Each of these terms can be computed as follows:Pr(A_l) 

5 This is the a priori probability that a random data record is submitted by agent A_l, and is 

6 equal to 1/N, for N records. Pr(true- value = X | A_l) This is the probability that an agent 

7 that submits (X 5 , f) has true value X, and is computed from X' = X + f(d) as g(X) = X' - 

8 f. Pr(true- value = X) This is computed as the sum of Pr(A i) x Pr(X | A_i) over all data 

9 records, and represents the probability that any one of the data records was represents data 

1 0 with true value X. 

1 1 Agent A_l can now select parameters for distribution f based on this analysis, so that the 

12 probability that it is the agent to submit the new data record is less than eps, its desired 

13 protection level. 

1 4 The rule has the right behavior — the more data present in the system then the more accuracy 

1 5 an agent can use to submit its own information, for the same privacy level epsilon. Notice 

1 6 that the agent is more protected as: 

17 1 . Pr(A_l) decreases, i.e. with more data records in the database 

1 8 2. Pr(true-value= X/ X% f) decreases, i.e. with more noise perturbation f 

19 3. Pr(true value = X) increases, i.e. there are more data points from other agents that 

20 might have true value X. 

21 The parameters for f, for example the mean and standard deviation in the case of 

22 noise generated from a Normal distribution, can be selected to set Pr(X' | true-value = X, f) 

23 = eps for the case that X' = X, i.e. when the random noise is zero. This presents the worst- 

24 case, assuming that distribution f places the most probability on zero noise. 

25 Finally, to compute parameters for f the agent requires aggregate information about 

26 the data in the database that has been submitted by other agents. In particular, the agent 

27 needs Pr(X), the probability that any of the current data records could correspond to data 

28 with true value X. In some cases it is important to receive this information without revealing 

29 true value X. 
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1 In a simple case, for example when a trusted intermediary such as SDI stores 

2 information provided by agents, the agent can simply poll SDI for Pr(X) before selecting a 

3 level of noise to use to perturb its information. Alternatively, when SDI is computing the 

4 level of noise perturbation to add dynamically as information is released in response to 

5 queries then the system of SDI can compute Pr(X) directly. In a more complex case, 

6 consider a problem where an agent is releasing information directly to an adversary, and the 

7 relevant set of data points are data that the adversary agent has already collected from other 

8 agents. In this case the provision of information Pr(X) must be done within a secret-protocol 

9 where the adversary does not learn the value of X in the process. A straightforward way to 

1 0 achieve this is for the "adversary" agent to post aggregated information about the probability 

1 1 that a data record in its population has true value X, for a range of different values of X, and 

1 2 allow an agent to use anonymous look-up in a table. 

1 3 In summary, the following procedure can be used to select an appropriate level of 

1 4 noise to add to a data point: 

15 

16 a. Choose a distribution family (e.g. Normal, Uniform, etc.), and let Par denote the 

1 7 parameters that define a specific distribution. 

18 b. Choose a level of privacy protection, eps where 0 < eps < 1. 

19 c. Request Pr(X) from the database, i.e. the current probability that a perturbed (or 

20 otherwise) data record in the database has true value X. 

21 d. Compute parameters for the distribution to set eps = max_{X'} Pr(A_l | X% X, f) 

22 where X' is the value generated from f. 

23 I OA. 2 Examples: Strong Protection^) Uniform Additive Noise Perturbation 

24 Distributions 

25 Assume that every agent submits information from a uniform noise distribution, 

26 centered around its true value. The decision variable in choosing a level of data perturbation 

27 when submitting new information is the range of the uniform distribution. Suppose agent 

28 1 submits data point g_l = [4, 6], to denote that its perturbed value is X', and the value was 

29 computed with additive uniform noise U(-l, 1). Agent 2 submits data point gjl = [4,6], 

30 agent 3 submits g^_3 = [3,5], agent 4 submits g^4 = [3,6], and agent 5 submits g_5 = [2,4]. 
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1 Suppose that an adversary knows that user John has true value X = 2.5, and that an 

2 agent for John has submitted a data record. In this case the adversary can be sure that 

3 agent 5 represents John, because Pr(A_5 | x = 2.5) = Pr(A_5) Pr(x = 2*5 | [2,4]) / Pr(x = 

4 2.5) = 0.2 * (1/2) / (0.2 * 1/2) = 1. In comparison, for an adversary that knows that Mary 

5 has true value X = 5, then Pr(A_l | x = 5) = 0.2 * (1/2)/ (0.2 * Vi + 0.2 * l A + 0.2 * 1/3) = 

6 0.375, and similarly for agent A_2. The probability that agent A_4 is Mary is 0.25. Now, a 

7 new user, Bill, with agent A_6 wants to provide information about its data X = 5, and wants 

8 to be sure that an adversary cannot determine its identity with probability greater than eps = 

9 0.1. Let d_6 denote the uncertainty selected by agent A_6, and compute an optimal d_6 as 

10 so that Pr(A_6 | x = 5), i.e. the probability that agent AJ has true value x = 5. This is 

1 1 computed as Pr(A_6 | x = 5) = 1/6 * (l/d_6) / (1/6 * 1/2 + 1/6 * 1/2 + 1/6 * 1/3 + 1/6 * 

12 l/d_6) = 1/ (4/3 * d_6 + 1), and is less that eps for d_6 > 6.75. Therefore agent A_6 should 

1 3 generate a perturbed value with additive noise computed with uncertainty 6.75, and can then 

1 4 be sure that its identity is protected. 

1 5 (b) Normal Additive Noise Perturbation Distributions 

16 Now, assume a normal noise distribution, and let sd_i denote the standard deviation 

17 selected by agent A_i for its noise distribution. The analysis is slightly more complicated, 

1 8 because the Pr(X | A_i) is now a function of the position of X within the distribution, not 

1 9 just whether it is in range as with uniform distributions. 

20 In this case an agent chooses the standard deviation for its 

21 Normal distribution by assuming that the randomized value X' = X, i.e. 

22 that the random noise distribution generates zero noise. This is the 

23 worst-case, maximizing Pr(A_6 | X) because Pr(x | A_6) is maximized 

24 and the other terms in the Bayesian expansion are invariant. 10.2 

25 Rounding and Binning of Data as an Alternative to Random Perturbation 

26 An agent can gain the same benefits of random perturbation in a more static scheme, 

27 where data is rounded or "binned" into intervals, i.e. agents report only approximate 

28 information to other agents. This works for the same reason, mapping each released data 

29 value to more than one true value. 
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1 The problem is to select buckets for information (i.e. intervals on data) so that there 

2 are enough data records in each bucket to provide the required level of protection against 

3 adversaries aimed at uncovering the identity of agents from the information that they report. 

4 The previous analysis extends to this case, but can be simplified. If an adversary 



5 knows that an agent has true value X, then its first task is to find the prototype value (or 

6 bucket) corresponding to the true value. The probability that an agent has the true value is 

7 the same for all agents that have posted values with the prototype value, and equal to 1/N_p 

8 where N_p is the number of agents with the prototype value. 



9 The decision for agents about submitting information is also simplified: if (1/N_p) < 

10 eps for the prototype that the agent's value maps into and a desired level of protection eps 

1 1 against an adversary, then select the prototype value in the interval and report information, 

1 2 otherwise report no information. 

1 3 In order to support agents with different levels of privacy-sensitivity the system of 



14 SDI can allow agents to select a degree of accuracy when submitting information— 

1 5 providing nested buckets. For example, we can allow an agent to specify that its true value 

16 can lie in one of a number of buckets, or equivalently in a single super bucket. The number 

17 of agents with true values in a particular interval is then computed as the sum of the agents 

1 8 that report values in an interval, and a weighted sum of the agents that state that their values 

19 are in one of a number of intervals, according to the total size of interval specified and the 

20 size of the current interval. 

21 A useful variation on this method is to control the amount and type of information 

22 released, but only releases accurate information. The degree to which a user can be 

23 identified if it submits information X to a vendor depends on the other agents that have 

24 submitted similar information. For example, if X is very detailed (e.g. my street address), 

25 then it is probable that no other agent has submitted the same information, and I will self- 

26 identify myself with that information. On the other hand, if X is quite abstract, for example 

27 my ZIP code, then it is quite possible that a number of other agents with the same ZIP code 

28 have already submitted and released the same information. The method discussed above can 

29 be simply adapted to allow a client-side SDI proxy agent, or the SDI data warehouse, to 
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1 decide how much information to release in order to protect a user's identity, based on 

2 information about the data already in a database. For example, instead of my street address 

3 my agent could choose to submit my ZIP code. 10.2.1 Example: Binning Techniques 

4 Suppose that an agent must decide whether or not to reveal its salary on the basis of 

5 information that has been revealed by other agents. The database supports intervals: 

6 <$20000, $20000-40000, $40000-100000, >$100000. The number of agents with each 

7 prototype value is currently 10, 12, 16, 5. 

8 Now, suppose a new agent represents a user with salary $200,000 and privacy 

9 threshold eps = 0.1. The prototype values prevent an agent from stating the high value (> 

10 $100000) because there are less than 10 agents in the bucket. However, if the provider also 

1 1 allows agents an option, for example "my value is in bucket 40,000 - 100,000 or > 100000", 

12 then the agent can state that information safely. In this case, the probability that the agent is 

1 3 an agent with salary $200,000 is computed as its weight in the top valued bucket, divided by 

1 4 the total weight, i.e. 0.5 / 5.5 < 0.1 

15 This technique of binning information, or rounding information, is simpler to 

16 implement that random noise perturbation because probability distributions must neither be 

1 7 sent to the provider or maintained by the provider. However, the information is represented 

1 8 with a static accuracy which is hard to improve as the amount of data submitted increases. 

1 9 This occurs automatically in the standard version. 

20 10.3 Adding Random Perturbations to Discrete Data Adding noise to discrete 

21 values is a little more complicated, because the new value must remain feasible. For 

22 example, in randomizing the name of a CD the new name must be the name of another CD, 

23 not some "made up" name. We need to add noise to make data elements "close" to the 

24 accurate values. With discrete data, such as the name of an artist, "close" must be defined 

25 within the correct metric. The appropriate metric is such that a "close" value shares many of 

26 the same characteristics. For example, it is not appropriate to assign a close value on the 

27 basis of a shared last letter in the first name, but it is appropriate to assign a close value on 

28 the basis of an artist from the same genre of music — from a "semantic cluster". 

29 Agents use discrete probability distributions to randomize data points. 
30 



Page 108 of 370 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 For example, suppose that an agent wishes to reveal the name of the artist that 

2 recorded the last compact disk that a consumer purchased. An artist's name is best viewed 

3 as a point in "artist space", and therefore as a discrete value. It does not make sense to 

4 change a random set of letters, because the new "name" will not be the name of a valid 

5 artist. Instead, the concept of noise is to randomly choose a new name close to the current 

6 name, i.e. choose a new feasible location in artist space. One reasonable solution for the 

7 names of recording artists is to define a neighborhood of artists that are close to the original 

8 artist, and select a new artist from within the neighborhood with equal probability. The 

9 metric that defines how close artists must be computed using a system that is common 

10 knowledge to the provider of information and the consumer agent, because the provider of 

11 information must be able to compute the distribution over true artist name, given a 

12 randomized name and probability distribution. We can define the probability distribution 

13 with a threshold in the distance d(namel, name2) metric, and the tag 'uniform' to indicate 

1 4 that each artist within the neighborhood was assigned an equal probability weight. A metric 

15 for artist names can be computed using clustering techniques, where artists are clustered 

1 6 according to the buying habits of consumers that purchase them. 

17 Semantic clustering that enables useful randomization of discrete field can be 

18 automated when goods are frequent purchase, high volume goods-where individuals 

1 9 purchase goods on multiple occasions, and more than one of the family of goods on a single 

20 occasion. However, it is possible that high price, low volume goods, should be randomized 

21 on the basis of expert analysis (for example new cars, computers...) - where an expert can 

22 extract key features of a purchase, and represent the purchase generically using either a 

23 single prototype good, or one of a set of approximately equivalent goods. 

24 Another approach to discrete data is to apply the "approximation" technique and 

25 allow agents to specify a genre of music for example, or a prototype value from a semantic 

26 cluster. This is equivalent to stating a range of salaries. For example, the system of SDI can 

27 provide clusters for music, and allow a submitting agent to state the cluster of the artist for 

28 the CD that it just purchased instead of the name of the artist. 1 0.4 Noise with Bundles of 

29 Information. In general agents will release more than one piece of information to other 

30 agents, in a bundle of information. Intuitively, when agents submit a bundle of information 
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1 they open up more possibilities for adversaries to determine their identity, because a bundle 

2 of independent information reveals more about a user than a single piece of information. 

3 With a couple of assumptions, we can apply the same technique of random 

4 perturbations. It is again required that an agent either has information about the distribution 

5 over data points present in the data base, or in the population at large. For example, consider 

6 an agent that has already submitted approximate information to a provider about its home 

7 ZIP code, annual salary, and model/year of car. The agent needs to decide on an appropriate 

8 level of randomization for new information about its profession. 

9 The required level of randomization depends on the number of other agents in the 

10 system that could have the same salary, ZIP code, car, and profession, given the randomized 

1 1 information that they have submitted. 

12 The problem of computing the current probability that another agent in the database 

1 3 might have true data B is more difficult because not all agents will submit the same bundles 

14 of information, i.e. some agents might provide more information than others. 

15 An agent will add independent noise to each element of its data bundle so that 

16 bundles of information released under two different pseudonyms cannot allow the 

1 7 pseudonyms to be linked. Also, if it is important that my data not be revealing an agent 

1 8 might add a random term that is shared across all my data values so that for example all my 

19 salaries or all my CD purchases are aggregated, an adversary does not get an accurate 

20 picture of my preferences. 

21 Assume that each element of a bundle of information is randomized independently. 



22 For example, if agent 1 has provides information about values X_a and X_b then it 

23 computes perturbed X'_a and X'b from adding independent random values from 

24 distribution f. This allows the probability Pr(X_a, X_b | Agent 1) to be computed for 

25 values X_a, X_b as the product of the two marginal random distributions. 

26 We compute the probability of (X_a, X_b) for an agent that has only provided one 

27 value, e.g. X'_a, we can assume that the value of the other element is independent of the 
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1 value of X'_a and multiply by the a priori probability of X_b, over all agents that have 

2 provided information about their values for X'b and therefore X_b. 

3 The appropriate noise distribution f can be computed as before by setting the worst- 

4 case probability that an adversary can identify the agent to its desired protection level. 

5 10.4.1 Example Here is a simple example for a problem where the data is two- 

6 dimensional, with two features. Consider feature a, 'year of birth', and feature b 'car 

7 model 5 . Suppose that agent 1 has submitted car model [VW Beatle, 0.1] to indicate that it 

8 selected a random car model from within a proximity threshold of 0.1 ; agent 2 has submitted 

9 car mode [Mercury Mistique, 0.05] and year of birth [1962 1968] to indicate that the year is 

10 somewhere in that range; and agent 3 that has submitted year of birth [1965 1975]. Now, 

11 agent Z, with true information (VW Beatle, 1972), computes a randomized data entry as 

12 agent 4, based on a privacy threshold of epsilon = 0.4 for an adversary with correct 

1 3 information about agent Z. Assuming that the Mystique is not within a threshold of 0.05 of 

14 the Beatle, agent 2 has placed on probability on its true car model being the Beetle. Also, 

1 5 assume that the proximity threshold of 0. 1 on the Beetle implies that agent 1 has a Beetle car 

16 with Pr(Beetle | Agent_l) = 0.2. Finally, the probability that an agent over the entire 

17 population has a Beetle is 0.1. Furthermore, the probability that an agent in the population 

18 was born in 1972 is 1/(7 + 11) = 1/18. Therefore, agent Z must place probability P_z on 

19 (Beetle, 1972) such that P_z / (1/18 * 0.2) < 0.4, i.e. P_z < 1/225. To support this, the agent 

20 can randomize each data point with probability (1/225)^(1/2) = 1/15, i.e. randomize 1972 

21 with a uniform distribution of years [-7, +7], and select a threshold on cars greater than 0.1 

22 to give a smaller probability than the 0.2 achieved by agent 1 with a proximity of 0. 1 . 

23 10.5 Bootstrapping A Data Set Bootstrapping is an important technique for the 

24 system of strong data perturbation, in which we assume that an adversary knows that a 

25 particular agent has submitted one of the data values in a data base, and knows the true 

26 values of that agent. 

27 A problem occurs at the start of a system, when there is little data in the database. In 

28 this case it is often possible that an agent cannot reveal any information because no other 

29 agent's data can possibly represent the agent's true value. For example, if an agent with age 

30 18 is deciding a level of randomization, and there is currently a handful of ages in the 
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1 database: {[24, 28], [34, 38], [42, 46], [42, 44] } then the agent cannot submit a randomized 

2 value and distribution, because the range of implied true ages will always include ' 18', and 

3 be the only approximate information to include ' 1 8'. The agent cannot provide its age to the 

4 database. 

5 We suggest a simple technique to bootstrap the system. The method works in the 

6 case that there are a number of different reasons to decline to provide information to another 

7 agent in response to its query. An agent provides randomized data and a privacy level eps to 

8 the database, e.g. ([16, 20], 0,2), where [16, 20] is the range of possibilities for an agent's 

9 true age, and 0.2 is the desired privacy level. 

10 The SDI database can be configured to only release information when it can verify 

1 1 that it is safe to. Similarly, an automatic method on the client machine of a user can be 

12 configured to only release information when it is safe to do so. The problem that is solved 

1 3 here is that sometimes there is no reasonable amount of noise that can be added to a data 

1 4 value to protect a user's identity. 

15 Another technique to solve the bootstrapping problem could hold all data until it is 

16 safe to release information for the majority of data points, given the state privacy 

1 7 requirements of users etc. The measure of "safe" is defined by the information in the 

18 database and the privacy level specified by each submitting agent. A more advanced 

19 technique could introduce data periodically as new parts of the data space become 

20 populated, so that there is never an "active" under-populated part of the space. 10.6 

21 Verifying Noise Levels with Playback We can use a cryptographic technique to 

22 verify the distribution of noise that is added to data - and also to enable replay. "Playback 

23 ability" - the ability to reconstruct the original record from a noisy version of that record is 

24 important for a number of purposes. An individual may want to obtain proof of a transaction 

25 for legal purposes and law enforcement agencies with appropriate warrants might want to 

26 examine original records. 

27 The client-side SDI proxy, or the central data warehouse, must keep a record of the 

28 non-randomized data X, and a method to verify that the randomized data was computed 

29 from data X. To generate a random noise value from a distribution f the agent can use a one- 
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1 way "trapdoor" function trap on the object X to generate a seed for a pseudo-random 

2 number generator. The pseudo-random number generator then generates a sequence of 

3 random numbers that are used to create the random perturbation from a well-defined 

4 algorithm. Suitable trapdoor functions known in the art include the RSA encryption- 

5 decryption function. It is not necessary for all agents to use the same private/public keys. 

6 With this method it is possible to verify randomization and audit data release with 

7 stated levels of randomization. SDI can request that an agent provides: X, f, and the 

8 trapdoor function trap, and can use this information to validate X'. Note, that because trap 

9 is a one-way function the agent cannot fabricate different values of X that would give X' 

1 0 given random function f. SE)I can "playback" the noise perturbation and produce the noisy 

1 1 record from the original record. 

12 11. Architectural VariationsThere may be commercial contexts in which an SDI service can 

13 be established where there is already in place a pre-existing trust relationship between 

14 multiple vendors and a third party. Such third parties are inherently motivated to provide 

1 5 services to enhance advertising and e-commerce for their existing and potential customers. 

16 These third parties may include, for example, web hosts or e-commerce service providers 

17 (ESPS) which often have hundreds or thousands of sites which they host, Web portals, 

1 8 information and commerce service manufacturers, advertising and affiliate network services 

19 and data analysis and business intelligence tool providers (which includes the business to 

20 business application). 

21 A third party may wish to implement an SDI which operates separately and 

22 independently from the central SDI service. Alternatively, some of these third parties may 

23 install an SDI server on their customer information server. The server may be integrated into 

24 an existing advertising service which they operate and maintain, in which case the vendor 

25 receives an appropriate fee for data which is exchanged between his/her existing customers, 

26 and a reduced fee (which may be split with the central SDI service) for data which is 

27 exchanged by/between a member of his/her SDI service and vendors who are members of 

28 the central SDI service but not of his/her local SDI service. Unless or until the client-level 

29 proxy server becomes a standard or a large critical mass of end-users adopts the service, one 
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1 of several compelling business models could be used by a vendor to encourage the visitors 

2 to his/her site to adopt the user-centric SDI service based upon the monetary incentives the 

3 user may receive for subscribing. Typically the resulting revenues are split between the 

4 user, the user-centric SDI service and the vendor (in exchange for promoting the service to 

5 their site visitors a share of resulting revenues generated may be necessary). 11.1 An 

6 open SDI systemAn ISP level proxy server can contain the user profile generation module, 

7 profile processing module, user profile interest summary generation module and target 

8 object generation module which operate in distributed manner. This enables an ISP to 

9 independently implement the core functionality of the system without the cooperation of 

10 information vendors (Web sites) or their operators (Web hosts) who opt-out of SDI. The 

11 modules in third-party SDI servers can share information with the modules in network 

12 vendor servers. This flexible architecture enables the user-centric SDI service to be 

13 implemented (by ISPs or completed independently) and when available inter-operating with 

1 4 the complete data sets available from the information vendors. 

1 5 SDI can allow third parties to operate their own secure advertising and/or electronic 

16 commerce-based product syndication affiliate network (for all customers). In accordance 

1 7 with the preferred implementation of SDI, these "advertisements" are represented at the 

18 item-level and may be distributed across the (proprietary or main) SDI system network 

1 9 presented in the form of ads, affiliate or portal links to purchasables or sites (which may 

20 contain target objects as purchasables) and/or simply (transparently integrated) selections on 

21 an e-commerce product catalogue. The users that are also subscribed to SDI, can be given 

22 highly personalized information for each site or for the network 'of sites (which could 

23 involve an interface which provides site to site links as a "virtual mall"), and a menu 

24 interface to these sites which includes the 2 or 3 dimensional personalized menu features 

25 and personalized search facilities as disclosed in the parent description (a "personalized 

26 portal"). Traditionally the term "portal" was exclusively reserved for major search 

27 engine/directories such as Yahoo, Lycos, Alta Vista etc. 

28 However, at the present time, there is a trend in which may Web-sites are becoming 

29 what are called "portals" or "community portals" serving a particular e-commerce or content 

30 niche or theme. Often highly robust content and/or extensive (usually web-wide) links to 
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1 relevant information resources and web-sites are accessible from such portals in order to 

2 create a "one-stop shop" service to visitors or community members. Additionally, in order 

3 to leverage low-cost access to more robust content, a transaction-based or "click-through- 

4 based" or transaction based model is also emerging. This model fits particularly well 

5 within the current framework of SDL In particular user profiles provided by SDI-enabled 

6 site visitors provides the platform for automatic generation of not only personalized ad 

7 banners but also content and portal links, it is reasonable for these community portals as part 

8 of their business relationships with the sites and advertisers to which they provide access to 

9 share revenues paid by these sites and advertisers with the end-users (which significantly 

10 increase the click through by virtue of personalization of information at potentially all 

1 1 varieties at the site resulting from revelation of the profiles from those visitors). As 

1 2 discussed on the parent case, these pages are pre-cached in advance onto the local server and 

1 3 client. 

14 Within today's internet infrastructure as pages are typically dynamically generated, 

1 5 the files are first pre-cached then dynamically generated local to where they are anticipated 

16 to be used. Additionally, high-end community portals typically offer content from external 

17 sources which is sometimes syndicated from content providers which are paid by click 

18 through or transaction. In this model the user or the site may absorb these fees paid to the 

19 content provider (because the content when personalized may appeal to almost any type of 

20 user, most typically the site would subtract a certain percentage of the click through or 

21 transaction fees paid to the user). As is discussed in the parent patent application, content 

22 (e.g. links to specific syndicated items) may also be created based upon the collective user 

23 statistics those links are presented to the users for which user profile information is not 

24 available (where these page links could be generated on-the-fly based upon the user profile). 

25 The user-centric SDI service also provides valuable information in not only 

26 generating the data model for personalizing site content but in general identifying what 

27 type of content to syndicate to the site. In a variation, it may even be possible to employ 

28 the statistical techniques used by SDI to dynamically select and generate personalized 

29 pages in a much more ad-hoc and unrestricted fashion. Using the syndication model, a 

30 vendor may agree (based upon their associated privacy policies), to enable other sites to 
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1 not only syndicate their content which may be contextually relevant to that site, but also 

2 actual portions of the site or portal may be linked from other pages on the site which are 

3 contextually relevant. These pages may be linked from the portal directory or as 

4 hyperlinks which are links to contextually very similar pages (or the vendor may allow 

5 these pages to be modified as needed by the site. As a general site development 

6 technique also the user's click through patterns are useful in updating or suggesting 

7 updates to the model to improve the value of the site. Of course, some vendor's privacy 

8 policies will restrict the ability of some sites which can utilize the content. 

9 1 1 .2 A Closed SDI System for a Syndicated NetworkThe Web host (or more 

10 generally a vendor, a provider and/or operator of server functionality to a variety of 

1 1 information vendors), may also be interested in operating his/her own closed version of 

12 SDI. The main SDI server for the closed system can be located on the network vendor 

13 servers, or it may reside upon the information vendors servers (as it is operated by that 

14 local Web host). For example, an affiliate ad network (including a web host acting in 

15 such capacity) could upon installing SDI onto their network enable and enforce the 

16 wishes and desires of advertisers (and particularly) sites which are advertised upon with 

17 regards to what types of sites and advertisers (respectively) they allow or disallow for 

18 purposes of standard or affiliate advertising, in accordance with the methods herein 

19 disclosed. The general implementation for determining which this general application for 

20 using collective user feedback to determine relevant site links was described in the parent 

21 issued patent). In this case, end users who are subscribed to SDI would receive 

22 personalized affiliate links (including product level recommendations for on-site 

23 purchases) which have been pooled and profiled at the main SDI server from all SDI 

24 vendors (in distributed fashion) and matched with the user. Alternatively, such approach 

25 may also be used for ISPs who wish to operate their own closed user-centric SDI. 

26 11.3 Interoperability Between Local SDI Services With interoperable (local) SDI 

27 services, we can also facilitate the secure enforcement of data sharing policies and transfer 

28 of transaction fees between these local SDI services E.g., by/between aggregations of ad 

29 networks, syndication networks e-commerce site, portals and Web hosts operating virtual 

30 portals and advertising/syndication networks with personalization as its primary capability. 
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1 In each of these primary example domains, the server operator is financially 

2 motivated to sell the SDI services to his/her sites because the transaction based model is 

3 used, and the server operator receives the commission on each transaction (or click through) 

4 occurring within his/her network of sites. However, if the server operator also integrates 

5 his/her local SDI service into the main SDI service (to share user lists and impressions 

6 and/or space to advertise to these target users), s/he can receive a commission (in 

7 conjunction with each vendor transacted with) for each advertisement placement or 

8 syndicated transaction to or from his/her network. 

9 We can also allow the local server operator to split the transaction fee (normally 

10 received from the main SDI service), thus "referral fee" for both the referred customer and 

1 1 the referral of customers (through the placement of outside ads or products on one of his/her 

1 2 sites) or other means of targeting his/her site's existing customers. 

13 Reduced overhead resulting from economies of scale which may likely result in 

14 incentives to the local operation, e.g., free installation and operation of his/her local main 

1 5 SDI server by the main SDI service, i.e., as the operational overhead would be cost justified 

1 6 by the shared transaction fees of customer referrals and advertising space coming back to the 

1 7 main SDI service. 

18 This architecture also may be useful and is ideally suited for cross vendor product 

1 9 advertising as through an ad network or product syndication network using affiliate links. In 

20 addition to the user profile generation module, a target object profile generation module 

21 should also reside across the network vendor servers such that it is possible to generate 

22 target object profiles for target objects on network vendor servers. Alternatively, user 

23 profiles and target object profiles are downloaded to the client level proxy which performs 

24 collaborative filtering tasks as the user browses from site to site. 

25 In both of these cases, the main SDI server can receive user profile data generated 

26 from the user profile generation module located on the ISP-level proxy, and target object 

27 profiles generated from the target-object profile generation modules located on the various 
2 8 multiple information vendor servers. 
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l 12. Ancillary Systems 

2 

3 12.1 Support for Pseudonymous electronic mail The ISP-level proxy server is 

4 positioned just behind the firewall of the user's local dial-up network (ISP or Intranet). The 

5 proxy provides protection for users operating under pseudonyms from point-to-point attacks 

6 and HTTP header-tracking by stripping HTTP header-information and forwarding HTTP 

7 packets on to their destination with no information other than their source at the ISP-level 

8 proxy server. The ISP-level proxy also supports pseudonymous e-mail, between users, and 

9 between users and vendors. 

10 Figure 2 shows a couple of users connected to clients, that are in turn connected to 

1 1 the Internet through a local intranet, such as the network of an Internet Service Provider 

12 (ISP). 

13 The proxy '"washes" outgoing messages of any information that would compromise 

14 a user's pseudonymity, for example the "referral" field that contains the previous URL of a 

15 user in a HTTP message. HTTP messages also leak other information, for example browser 

1 6 software on a user's client machine, the operating system and a user's IP address. 

1 7 A user can receive electronic mail through the PID and associated IP address of the 

1 8 ISP-level proxy server. 

19 The preferred implementation of this system allows the user to periodically check 

20 for new mail. The client-level proxy gains access to the mail box that is associated with a 

21 pseudonym by providing a correct response (signature) to an ISP-generated challenge. 

22 Notice that with this solution, the ISP-level proxy has no way to connect the pseudonyms of 

23 a user, so long as the user's client is not identified in its messages to the ISP-level proxy 

24 server other than by the PID that the proxy makes a request for. 

25 We can extend this mechanism using a technique taught in the Lucent Personalized 

26 Web Assistant (LPWA). The LPWA [BGGMM 97; BGGMM98] provides for a sequential 

27 access mechanism to the mailboxes that belong to a user through a one-way function that 

28 takes the user's SDI log-in name and password, and an integer from 1 to N, and computes 

29 the mailbox location. The mail server does not need to maintain a list of pseudonyms for 
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1 each user, because the user is able to efficiently access all of its mailboxes sequentially as a 

2 function of other information. 

3 Another variation, that relies on the user placing trust in the ISP-level proxy server, 

4 provides the ISP-level proxy with the e-mail address for each pseudonym. This push method 

5 is more efficient, because the ISP proxy and the client proxy communicate only when new 

6 messages arrive, but provides the ISP proxy with information to compute all the 

7 pseudonyms for a single user — probably undesirable. 12.2 Support for Pseudonymous 

8 Physical Mail 12.2.1 Vendor to User A vendor must hold a "physical mail 

9 certificate" to be able to send mail (packages, letters) to a user under a pseudonym. The 

1 0 certificate is similar to the "electronic mail certificate", in that it is signed by the private key 

1 1 of the user's pseudonym, and indicates that the vendor with public key P*V can send mail to 

1 2 the user (under the pseudonym). 

1 3 Each user has a trusted physical address authority, just as it has a trusted electronic 

14 mail authority (the second-level proxy server), that maintains the physical mailing address 

1 5 for each pseudonym. When a vendor has a letter X to mail to user with public key PKP, the 

16 vendor generates a unique ID for the package, IDX, and sends the ID code and the physical 

1 7 mail certificate to the trusted physical address authority of the user. 

18 The physical address authority receives the certificate, S( (PKP, PK*V, 

19 SEND_MAIL), SKP), that indicates that the vendor is authorized to send mail to the 

20 pseudonym, and the packages identify code, signed by the vendor to certify that the vendor 

2 1 holds the secret key that matches the public key in the physical mail certificate. 

22 The vendor then passes the letter X and the signed ID code to a trusted mailer, that 

23 supports pseudonymous mailing, and has been certified by the central SDI server as such. 

24 The trusted mailer then provides the signed ID code to the physical address authority, signed 

25 with the private key of the trusted mailer. The physical address authority verifies that the 

26 trusted mailer is a valid service, and releases the real address of the user to the mailer. The 

27 mailer now has the letter X that the vendor wants to send to the user with pseudonym P, and 

28 the physical mailing address of the user - and the package can be mailed. At not time did the 

29 vendor determine the true mailing address of the user, unless it works in collusion with the 
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1 trusted mailer, but the trusted mailer is certified by SDI, and also audited by the chosen 

2 physical address authority of the user. The address authority will only release addresses to 

3 reputable pseudonymous physical mail agents. 

4 We can operate physical mailing lists in the same way, and gain additional security 

5 by never releasing the pseudonyms or the mailing addresses to the vendor that has requested 

6 the targeted solicitations. We can use a technique that is similar to the technique that we 

7 used for virtual mailing lists. The vendor describes its solicitation to the central Secure Data 

8 Interchange, which leverages as much data as possible (without violating the privacy 

9 policies of any of the users or vendors that are represented within the data). The central SDI 

10 server generates a list of suitable pseudonyms, and then provides a series of unique codes to 

1 1 the vendor, that the vendor can supply to its chosen pseudonymous mailer with the material 

12 that is to be mailed. The central SDI server also provides the appropriate address authorities 

13 with authorization to release the physical mail addresses to the mailer when presented with 

14 the IDs. Notice that at no stage did the vendor have the pseudonyms or the mailing 

1 5 addresses. The parties all have only as much information as is necessary - the vendor needs 

16 someway to identify its packages to the pseudonymous mailer. The mailer needs an 

17 identifier to present to the address authority, and receives the addresses. The address 

1 8 authority just needs to know what addresses to release and to which third party. 

19 12.2.2 User to Vendor mail The Secure Data Interchange system also provides a 

20 mechanism for users to send physical mail to vendors that are registered with SDI with 

21 pseudonymous return addresses. In particular, when a user sends mail to a vendor, the first- 

22 level proxy server provides a tool that: (1) Computes/Looks-up the appropriate pseudonym 

23 for the user with this vendor. (2) Generates a unique ID, and submits a signed message to 

24 the central SDI-server, where the message relates the pseudonym, the vendor, and the ID. 

25 (3) Provides the unique ID to the user. 

26 The user writes the unique ID on the envelope, and mails it to the vendor. Should the 

27 vendor wish to reply to the user, then the vendor can take the envelope to a pseudonymous 

28 mailer, and request that the envelope be mailed appropriately. The pseudonymous mailer 

29 verifies the identity of the vendor, and then submits the ID, together with the vendor's 
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1 signature, and its own signature, to the physical address authority that is maintained by SDL 

2 SDI releases the address to the mailer that can then return the mail. 

3 12.3 Pseudonymous Payment Mechanisms The Secure Data Interchange 

4 architecture must be able to support all the standard electronic commerce functions that we 

5 take for granted, but while maintaining pseudonymity for users and following privacy 

6 policies. There are various different solutions to this problem. 

7 12.3.1 Anonymous Credit Card Payment [LMP 94] The second-level proxy server 

8 can maintain information on the user's credit card information, and perform the following 

9 transaction. Whenever a user makes a purchase from a vendor, the user provides the vendor 

10 with authorization to bill $x to his/her credit card account, but anonymously - through the 

1 1 Secure Data Interchange as a middleman. The user generates a unique number, Y, and signs 

12 a "right to payment" message, M=( $x, PKP, PKV, Y), that gives the vendor the right to 

13 make a claim for payment of $x from the Secure Data Interchange. The first-level proxy 

14 server registers the unique number Y with the second-level proxy server to ensure that the 

1 5 vendor does not spend the money twice, and provides the proxy server with authorization to 

1 6 charge $x to his/her credit card when the request for payment is presented. 

1 7 When the vendor submits its "right to payment" and proof of identity to the second- 

1 8 level proxy server the proxy server first runs the charge through the user's credit card, and if 

1 9 that clears, runs the charge from the vendor through the account of SDI (which could also be 

20 a credit card, or could be operated as electronic cash or some other mechanism for 

21 payment). 

22 This "anonymous credit card" payment method has the following properties: 

23 1. The user's credit card pays $x, but does not know who receives the money 

24 except that it is going to the Secure Data Interchange. 

25 2. The vendor receives payment for $x, but does not know the user's credit card 

26 information, or the user's identity. 
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1 The Secure Data Interchange incurs no financial risk because it receives payment from the 

2 user before making payment to the vendor, although there could still be problems if the user 

3 complains about the quality of the good for example. 

4 This protocol is simpler than full cryptographic anonymous credit card mechanisms because 

5 the SDI acts as a rusted third party to both the user and the vendor. 12.3.2 Electronic Cash 

6 [Chaum 85; Chaum 92]Electronic cash is anonymous, just like physical cash. The user 

7 purchases electronic cash from an electronic bank, presenting blinded notes, so that the bank 

8 has no record of the note numbers that it issues to the user. For example, the user generates a 

9 new note number, X, and has the bank sign a blinded copy with its $10 signature, S(B(X), 

10 SKBANKS10). Then the user, or the first-level proxy for the user, removes the blinding 

1 1 factor, and can use the electronic cash as tender. Whenever the note changes hands the 

1 2 recipient needs to check with the bank that it has not yet been spent, because notes are easily 

1 3 copied, but not forged. 

14 Electronic cash has the same useful properties as anonymous credit cards, although it is 

1 5 perhaps a little more exotic. In particular, notice that the bank does not know to whom, or 

16 for what, payment has been made, and the vendor does not know which user made the 

17 payment - it just receives the payment. We have minimized the amount of information 

1 8 exchange that takes place between the various parties in the system. 

19 12.4 Client-Side SDI ProxyThe client-level SDI proxy, implemented as a client program 

20 running on the user's client machine, manages all data transfer between the client machine 

21 (and the user), and other vendors and the central SDI data warehouse. A key function of the 

22 client-level proxy is to implement profile management for a user, to control the ability of 

23 agents to track a user as he/she interacts with multiple vendors. The client-level proxy also 

24 controls release of profile information: the addition of demographic and other personal 

25 information to profiles, and the control of random perturbation to fields to prevent linking 

26 across user profiles. 

27 The client-level proxy maintains profile information for a user's collection of pseudonyms, 

28 and allows the user to view and challenge profile information. The proxy also provides a 

29 rule-based interface to allow a user to select appropriate privacy/personalization policies. 
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1 The primary mechanism that protects the identity of a user across multiple vendors and 

2 service providers is the ability to interact pseudonymously with vendors. The user can 

3 choose a unique pseudonym for each third party with which he/she interacts, and be 

4 absolutely certain that he/she is the only party that knows his/her true identity. There is no 

5 way that a vendor can know anything about the transactions that a user has had with other 

6 vendors under alternate pseudonyms unless the user chooses to disclose the equivalence of 

7 pseudonyms, or use the same pseudonym across multiple vendors. 

8 13. MODES OF USE OF SDI 

9 It is useful to distinguish three key modes of use of the Secure Data Interchange system: 
10 

11 o Static data-mining. Query execution with no dynamic requests for new 

12 information from information providing agents. 

13 o Interactive data-mining. Query execution that includes dynamic attempts to 

14 request additional information from information providing agents. 

15 o Client-side data-mining. Query execution that is performed on the local client 

16 machine of an agent, based on data stored exclusively on that machine. 
17 

18 All three modes may use distributed information, i.e. it is possible that the information is 

19 stored in the central SDI database, or on distributed client-side information servers, or in 

20 third-party servers. The first 'static' case and the second 'interactive' case are 

21 distinguished from the third 'client-side' case in that the query execution is performed 

22 centrally in the SDI data warehouse in the former, and on an agent's client machine itself 

23 in the latter. Client-side data mining has particular application to privacy-protected 

24 customization of information and services in on-line business-to-consumer applications. 

25 l. Static Data Miningln static data mining queries are executed on the information that is 

26 currently present in the SDI data warehouse, and there is no opportunity to contact agents 

27 and request more information. Applications of this type of static data mining include all 

28 types of "standard" database queries, where it is assumed that the data set is static. 

29 Queries may be open-ended, i.e. "find me all data records of this type, and perform the 

30 following operations.." or closed, i.e. "perform the following query on the data record for 



Page 123 of 370 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 agent with pseudonym PI". Queries may also have side-effects, i.e. "find all records that 

2 satisfy this constraint, and then take action A". 

3 2. Interactive Data Miningln interactive data mining multiple agents may be contacted 

4 by the system of Secure Data Interchange in the process of executing a query, to request 

5 new information from agents, or push information to agents. A central application of this 

6 mode of data mining is matchmaking, which is a process where information flows 

7 between agents if and only if both the profiles of both agents are mutually compatible. In 

8 interactive data mining the querying agent does not need to know the identities of agents 

9 that are contacted by SDI, this is all transparent, and hidden from the querying agent. 

10 3. Client-Side Data Miningln client-side data mining the querying agent executes a 

1 1 query with the information associated with a specific agent, and the information remains 

12 located on that agent's client machine throughout the query. The query is executed by 

13 providing the query method to the client machine, processing the method with local 

14 information, and then returning a response or taking an appropriate action (e.g. displaying 

15 product X for price Y). A central application of client-side data mining is to privacy- 

16 protected customization, where a vendor wishes to customize its products and services 

17 for a particular agent that is registered with SDI, and take advantage of personal 

18 ■ information relating to that agent that is not generally available. As another application, 

19 we describe a client-side advertising auction, where advertisers compete for the right to 

20 display a banner advert to a user, based on local information about the user's preferences. 

21 4. Applications: Very Brief Overviewln the next section of the SDI description we 

22 describe some specific variations and systems that can be implemented within the general 

23 architecture. In overview, we describe the following key applications: 
24 

25 o Safe user profiling and personalization. This allows on-line users to receive 

26 personalized information and services without providing personal information to 

27 vendors, so that users retain control over their personal information. Users can 

28 interact with vendors under different pseudonyms, and provide information to the 

29 central SDI data warehouse to allow data mining. Finally, users can allow specific 

30 vendors to execute queries, where the result of the query is information that 

3 1 allows that vendor to customize its service. 
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1 o Client-side user profiling. The client-side SDI proxy can monitor the browsing 

2 behavior of a user, and submit data periodically to the central SDI data warehouse 

3 with appropriate meta information to provide the user with guarantees about the 

4 type of information that can be released to vendors during query execution. 

5 o Static Data mining applications. Agents can submit queries to the data warehouse 

6 query-execution module and perform data mining and collaborative filtering on 

7 aggregated and anonymous information provided by data submitted to the 

8 warehouse by agents. 

9 o Interactive data mining applications. Agents can request actions from the SDI 

10 system if certain conditions are found to exist in information, for example SDI 

1 1 can send information to other agents about services or products, if good matches 

12 are found. Another example is a request that a certain number of agents with 

1 3 particular properties be contacted and asked to take a particular action, we give an 

14 example within a transportation domain. 

15 o Pro-active data mining. The system of SDI might itself pro-actively execute data 

16 mining queries, and index and classify certain types of data to allow more 

17 efficient future query execution, and to also suggest useful information to its 

1 8 client agents. The answers to popular queries can be priced and cached, so that 

1 9 providing agents receive value whenever an answer is sold to another agent. The 

20 system of SDI might advertise a set of queries to allow vendors to select pre- 

21 computed results. 

22 o Matchmaking applications. We described in the top-level description of SDI a 

23 technique to implement "persistent queries", which reside on the central data 

24 warehouse's query execution module, and are triggered whenever the correct 

25 conditions exist in the data base. One typical use of such a query is to say 

26 "introduce me to other users with property P" such that the system introduces 

27 user A_l with user A_2 if A_l has the property required by A_2 and also A_2 has 

28 the property required by A_l. This is equivalent to "introduction by mutual 

29 consent", and is possible within SDI without information leakage, because the 

30 only agents that are informed of a match are the agents with the correct properties. 

3 1 Applications exist to finding a business partner, funding a new start-up (incubator 
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1 Co.), forming an interest group, n-way negotiation, introducer system (by mutual 

2 consent). 

3 13.1 . Static Data Mining ApplicationsThis section describes specific applications 

4 of SDI-based static data mining. A central example is collaborative filtering and 

5 personalization applications in electronic commerce, where consumers and vendors 

6 provide information to SDI, and the information can be queried within the price and data- 

7 access rules placed with the data by owners of the information. We refer to the variation 

8 of SDI with consumer profile information as the "iamworthit" system, because users can 

9 place profile information in a shared database and receive payments in return for queries 

10 performed by vendors. Vendors can use the profile information to build better 

11 customization models, and provide customized products to customers based on their 

12 profiles and what has worked with other customers with a similar profile. 
13 

1 4 We limit our attention in this section to "static" datamining, which as defined in the mid- 

1 5 level SDI description allows agents to query the data, but without contacting the agents to request more 

1 6 information. All queries are performed on the basis of the information already submitted to the database, 

1 7 and the rules associated with that information. Within the set of static queries, we do allow a vendor to 

1 8 identify a specific agent within a query command, so that if a user provides an identifier to a vendor then 

19 the vendor can query the database with its collaborative filtering model and determine an appropriate action 

20 to take. It is possible to request an action as the result of a query, for example "send message X to all users 

2 1 with profile information Y", so long as this message is not contingent on as yet unknown information about 

22 the user. We describe specific examples, for example to a smartbrowsing system, an education portal, and 

23 an advertising network. One of the core purposes of SDI is to provide a common location and format for 

24 information that has been gathered from a wide variety of sources and that might require different sorts of 

25 analysis. Since its framework is designed to handle different types of data and algorithms, SDI can be used 

26 as a platform to explore and exploit the rich connections that potentially exist within and across the 

27 databases of different vendors and customers. The system is designed to allow vendors to execute queries 

28 over profile information provided by multiple agents, and ensure that all queries are consistent with the 

29 policies outline by agents as information is first submitted. The central SDI server can also support 

30 cross- vendor and single-vendor personalization tools, such as multi-attribute 

31 collaborative filtering techniques. The queries can be executed to enhance a vendor's 

32 model, without providing the raw data to a vendor. A vendor's model can be enhanced 

33 without explicitly revealing any information about user profiles. The SDI server ensures 

34 the integrity of data, and prevents data being used for unauthorized purposes. A query 
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1 will receive access to data as permitted by certificates presented by the querying agent, 

2 and to the extent that the querying agent is willing to make payments defined in the price- 
's access rules associated with the data. As discussed in the top-level SDI description, the 

4 query-execution module implements an internal market with the data, and executes an 

5 agent (i.e. a vendor's) query as cheaply and efficiently as possible. 

6 13.1.1 Statistical Techniques for Multi-dataset Collaborative Filtering This section 

7 describes a statistical method for cross vendor and cross data-set collaborative filtering. The 

8 example considers particular types of data and analytical methods and suggests forms of 

9 validation that can be made available within SDL 

10 In describing this system, we show how the Secure Data Interchange architecture can 

11 integrate the architecture issued U.S. Patent No. 5,754,939 "System for Customized 

12 Identification of Desirable Objects" into a system for secure data exchange between 

1 3 multiple parties. The aforementioned patent teaches a method for profiling objects and users 

14 over a bi-directional distributed network, such as: an ISP, multiple ISP networks, a Web 

1 5 hosting network, or server software (such as data mining or recommender software) that is 

1 6 linked to a coalition of sites (such as a portal or Internet mall). 

17 The current invention, the system of Secure Data Interchange, allows correlations to 

18 be identified between vendor's data sets, that allows accurate profiling through the 

19 application of statistical methods, without providing vendors with explicit access to the 

20 profiles of users - because profiles are provided in anonymized and randomized forms. 

21 There are less efficient methods that can be used to identify correlations, for example using 

22 customer demographics, and vendor categories, to suggest which vendors might be well 

23 placed to form dynamic syndication relationships. With SDI it is possible to leverage as 

24 many data sources as are available, about users and the target objects with which they 

25 interact. In fact the degree of the measure of improvement in predicting user behavior (or 

26 increasing click through) is approximately in direct proportion to the square root of the 

27 number of user profiles and target profile interest summaries which are known. The 

28 emphasis in the aforementioned patent is on the bilateral relationships between vendors and 

29 users, and the architecture is not designed to support secure and privacy-protected data 

30 interchange and analysis across the user bases of different vendors. 
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1 In the system for SDI we push control of the profile for each user to the client software that 

2 runs on the machine local to the user, and provide for personalization through dynamic 

3 processing of information on the user's client machine. Similarly, we enable vendors to 

4 exchange data sets only to the degree that is mandated by users, and provide technical 

5 solutions to enable significant leverage of data while maintaining user privacy. 

6 The supporting architecture as stated in the above referenced patent also allows for 

7 profiling statistics to be collected and processed in a distributed manner. In the present 

8 invention the profile generation capabilities can be implemented at various levels, depending 

9 on where profile information is physically located. As described earlier, the central SDI data 

10 warehouse can be nothing more than a 'Virtual" database, with multiple links to data that 

1 1 physically resides on client machines that belong to users and vendors. 

12 To enable useful cross-vendor profiling, vendors can submit web pages that are 

1 3 tagged with profiles of target objects, user quality ratings based upon overall quality as well 

14 as other criteria (e.g., value, price, entertaining, informative graphic/visual appeal, etc.), 

15 location data (for target objects representing physical or geographical items), etc. User 

1 6 information, in addition to profiles, can include data mining and trend analysis statistics, and 

1 7 user provided ratings for target objects. 

1 8 As previously described in this patent, various conditions can be placed on the way 

1 9 in which a set of data may be used (i.e., can the user make a personal copy of the dataset?), 

20 as well as on the privacy controls put in place. It might well be that a vendor or a user is 

21 willing to share only a portion of his database, or that he will release only randomized data 

22 in accordance with the level of privacy he has guaranteed his customers. Although such 

23 restrictions could impact the content of the data analyzed by a vendor, as long as it is kept in 

24 an SDI-compliant format it can be analyzed by SDI's suite of tools. 

25 The data that is stored in the central SDI server has tight usage restrictions. For example, the 

26 user will have specified (via a price-policy) that data can only be used by querying agents 

27 that are able to present certain types of certificates, only anonymously, etc. The central data 

28 warehouse maximizes the value of data by analyzing conditions and supporting queries with 

29 the cheapest possible price rules. 
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1 The information that is stored in the central SDI data warehouse can be provided by users, 

2 via their SDI client-level proxy agents, and relating to their own individual purchasing and 

3 buying habits. Alternatively, the data might be provided by vendors, and relate to what 

4 information that vendor decides to release about its customer base — and in agreement with 

5 stated practices that the vendor follows about the information that it collects on its 

6 customers. 

7 It is not necessary for data records to physically reside on the central SDI data warehouse. A 

8 vendor may provide links to data, and use a locally secured query-execution module to 

9 carefully control queries of the data. For our current purposes we assume a uniform data 

1 0 access model, where although both data and algorithms might reside either at vendors' home 

1 1 locations or within the SDI system itself, the general analysis works transparently across 

12 these boundaries. 

13 For security reasons, the contents of databases may be injected with a small amount of 

14 noise. This prevents database users from surreptitiously connecting database records to 

1 5 individual customers, yet maintains the quality of inferences made about the database in 

16 general. This is described earlier, as noise-perturbed data release. Although such "noisy" 

17 records don't pose too much of a problem for those methods that make generalized 

1 8 inferences, it should be noted that recommendations made for individual customer vectors 

19 that have undergone such randomization will be less useful, since predictions are being 

20 made for a noisy target. 

21 A final consideration is the reduction of the data vectors' dimensionality (which can be 

22 extremely high), since it is harder to make clean inferences about sparse data. There are 

23 many standard methods that can be used to achieve this, such as Principal Components 

24 Analysis. Another approach is to adjust the granularity of the data, if at all possible. In a 

25 music store analysis, for example, there might be many more album titles that artists (since 

26 each artist can produce multiple albums). In such a case, purchases could be recorded by 

27 artist rather than by album, greatly reducing the dimension of the customer vectors' 

28 purchase space. 
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1 13,1.1.1 Data Structure In this application there are many types of information which 

2 can characterize both users and items. SDI is intended to function as the intermediary 

3 between a vast web of vendors, on the one hand, and individual consumers, on the other 

4 hand. Major sources of data include: 

5 1 ) Demographic. Such data will most likely be elicited by SDI from vendors and 

6 consumers when they initially register for the service, and details very general 

7 characteristics about them. It will consist of numbers and categorical values (age, zip 

8 code, sex, level of education, etc.). 

9 2) Commercial. This is the kind of data any that vendor collects in the course of doing 

1 0 business (especially e-commerce); generally, it links customer codes to purchase 

1 1 items, dates, quantities, and prices. Depending on the nature of the business, this data 

12 could be fairly complex, and might well include text. For example, one could imagine 

1 3 that a bookstore, in addition to keeping track of its sales history, collects book 

14 reviews, author profiles, and plot summaries. 

15 3) Behavioral (vis-a-vis the Internet). A user's client-side SDI proxy can monitor 

16 his/her browsing behavior on the World Wide Web, monitoring the pages a user hits, the 

1 7 click stream and content requested, etc. Click stream information can be useful, for example, 

1 8 because it can indicate a user's interest in the information that it is presented with. 

19 We assume that vendors and/or a third-party annotate web pages with tags, that 

20 provide a commentary of a web page and allow meaning to derived from a user's browsing 

21 behavior. 

22 The mere presence of such tags allows for correlations to be drawn between 

23 different web pages (e.g., a common Extensible Meta Language (XML) tag used by travel- 

24 related sites), because it implies similarity. Furthermore, it is conceivable that such tags 

25 could encode more refined measures of a web page's content, such as browsers' evaluations 

26 of its value. For example, a web page of interest to scale modelers, in addition to having 

27 images and text related to model trains, might have an XML tag that shows that other scale 

28 modelers have given the web site a "five-star" rating. This page should therefore be given a 

29 greater weight when SDI is used to create correlations of interest to model hobbyists. 
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1 In what follows, we assume that SDI contains data of the following types: 

2 1) Numerical (e.g. an age, price, or period of time). 

3 2) Categorical (e.g. a color or musical genre). 

4 3) Text. 

5 We describe a mathematical model for computing useful collaborative filtering 

6 queries across heterogeneous data records, such as the data stored within the central SDI 

7 data warehouse. Throughout, we present only the basic mathematical model. In 

8 implementation the query would be formulated in a general-purpose language, for example 

9 Java, and executed in the central SDI data warehouse subject to the conditions on access to 

1 0 data as expressing users price-access rules. 

11 13.1.1.2 Computing Correlations Between Data Points A common task for SDI is to 

12 compare and correlate different customers, which might well be represented by mixed 

13 collections of numbers, categories, and blocks of text. This is handled by treating each 

14 customer q as a vector in a space whose coordinates correspond to the fields of data 

1 5 available. In the following description we refer to a customer, but when a user interacts with 

16 a vendor under a pseudonym, the profile information will only relate to information 

1 7 provided to the central SDI server for that pseudonym. 

18 If there are m numerical pieces of data available, there will be n corresponding 



n 

1 9 coordinates in the data space, (xi , . . . , x m ). 

20 For each category i, there will be a corresponding number of values, ni. Hence, for a color 

21 category {red, white, blue}, rw = 3. Since each value is assigned its own coordinate, 

22 category i is represented as an nj dimensional vector, y\. Hence, the total number of 

23 dimensions used to describe the full set of n categories (yi, . . ., y n ) is 

24 Note that sparse methods are especially useful here, since a categorical vector yi will 

25 typically consist of mostly zeroes, with a single non-zero coordinate representing the 

26 categories' value (i.e., we encode the color red, using the previous example, as (1,0,0) ). 

27 Note also that category vectors with different values are treated as orthogonal by the system. 
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1 A final issue is the representation of text. As described in previous related patents, 

2 all relevant blocks of text in the database are converted into a dictionary that maps unique 

3 strings to the number of times they appear in the database. An appropriate TF/IDF weighting 

4 function is chosen and calculated for each of the p words that appear in the dictionary. The 

5 full set of text connected to a single customer can thus be represented as the vector (z !? 

6 z p ), where each zi equals the number of times the word i appears in text related to the 

7 particular customer multiplied by the TF/IDF score assigned to word i. 

8 In summary, when a database describes its customers using a combination of 

9 numerical values, categories, and text, customer i can be represented by the vector Cj=(xi, 

10 ...,x n , yi, y m , zi, Zp). 

11 13.1.1.3 An Example Profile VectorSuppose we have a database containing information on 

12 customers' ages, their musical preferences (i.e. an answer to a survey asking: "Which do 

13 you prefer, Mozart or the Beatles?"), and the contents of the emails they've written. 

14 Furthermore, suppose the only salient variables in all the emails written consist of the words 



TF/IDF(x) = 




1 5 "Beatles", "Mozart", and "practice", and that we are using the function 

1 6 Where nx represents the number of times word x appears in the dictionary. We now 

17 want to represent one of the customers in the database; he's a 10-year-old boy who prefers 

18 Mozart to the Beatles, and who wrote an email to his friend that mostly describes his 

1 9 attempts at practicing Mozart, but in passing mentions his sister's new Beatles CD. Suppose 

20 he uses the word Mozart 2 times (although it appears 456 times in the full database of all 

21 customers' emails), the word Beatles 1 time (appears 217 times in database), and the word 

22 practice 3 times (appears 77 times in database). 

23 We define the following coordinates: 

24 X!=age=10 

25 yi = {Mozart, Beatles) = (1 ,0) 

26 zi = # of times customer uses word "Beatles" x TF/IDF("Beatles") = 1 * 0.067 = 0.067 
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1 z 2 = # of times customer uses word "Mozart" x TF/IDF("Mozart") = 2 * 0.047 - 0.094 

2 z 3 - # of times customer uses word "practice" x TF/IDF("practice") = 3 * 0.1 14 = 0.342 

3 In our example, then, we might encode this boy as customer 1: 

4 c, = (x,, y,, z,, z 2 , z 3 ) = (10, 1, 0, 0.067, 0.094, 0.342) 

5 13. 1 . 1 .4 Choosing an Appropriate Level of Data Granularity We define the term granularity 

6 to denote the level of detail available within a given set of data, which is often structured 

7 hierarchically. Suppose a grocery store database contains records for a box of flavored 

8 gelatin powder. This could be categorized in a variety of ways; moving from the most 

9 specific to the most general, we might treat this data point as "12.5 ounce, strawberry flavor, 

10 Jello-brand gelatin dessert" (which would be entirely different from "12.5 ounce, banana 

1 1 flavor, Jello-brand gelatin dessert"), or as "12.5 ounce Jello gelatin" (a categorization which 

12 would treat as identical the strawberry and banana Jellos), or as "flavored gelatin", or as 

1 3 "dessert", or as "food", or as "grocery". 

14 When analysis is performed on such data, the level of granularity chosen will have a 

15 strong effect on the outcome of the analysis. If the level of granularity is too fine-grained, 

16 the data will be too sparse, although it could be potentially aggregated to the next highest 

17 level of granularity. If the granularity is too coarse, the results of the analysis might be 

18 overly general (e.g., a customer would find a collaborative filter useless if the only 

1 9 recommendation it makes for a dessert choice is "go to the grocery section of the store"). 

20 Since the level of granularity will have a salient effect on the outcome of an analysis, 

21 it should be chosen very carefully, and might well play a factor in pricing when a vendor 

22 chooses to sell its data. 

23 13.1.1.5 Statistical Methods for Data Analysis In order to perform a wide range of 

24 analytical tasks, SDI needs to make use of a variety of computational approaches. These are 

25 described below, starting with the simplest methods first. 

26 o (1) Standard Database Searches 

27 Since most of the data will be stored in centralized databases, simple searches, queries, 

28 and data filters can be implemented by means of standard SQL commands. Typically, 
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1 data will be collected or sorted using efficient database calls before being fed through 

2 analysis routines; once complete, the results can be fed back out to the database 

3 environment for further efficient manipulation. 

4 o (2) Metrics - Measuring the Similarity Between Profile Vectors 

5 Given two customer (or vendor) profiles, q and cj, it is frequently desirable to know how 

6 similar they are. For this purpose, we define the similarity metric M(Cj, Cj) to be a 

7 function that takes as input two customer vectors and returns as output a numerical value 

8 in the range [0,1]. When two customers q and q are identical, Mfe, Cj)=l; when they're 

9 completely different, M(ci, cj)=0. 

10 The problem is somewhat simplified by the fact that we treat all customers as vectors. 

W)=COS0= H1 

1 1 Given two customer vectors, we can use the correlation between them to serve as our 

1 2 metric: 

1 3 Note that 0 here represents the angle between the vectors A and B, and that we expect 

1 4 all coordinates of the vectors to be positive (in order for M(A,B) to keep its output in the 

15 range [0,1]). 

16 In more complicated cases, however, a customer vector might contain multiple fields 

1 7 with varying ranges of values. For example, we might have customer vectors of the form 

18 Cj=(agei, incomeO, in which the maximum age is 80, but the maximum income is 

19 300,000. In such cases, the coordinates with larger values will dominate the similarity 

20 metric, overwhelming any influence that smaller fields might have. 

21 This requires a normalization of the customer vectors, which can be done in several 

22 different ways. One approach would be to scale every coordinate by the maximum 

23 observed value, forcing all coordinates to lie between 0 and 1 (again, enforcing the 

24 rule that all coordinates must be positive). 
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C; - 



^ a S e i income i 



v 



max(age) ' max(income)j 
1 

2 The only problem with this is that if a coordinate's maximum value is an outlier (being 

3 vastly bigger than the typical value), most of the coordinates' values will seem unusually 

4 small once they are scaled by the maximum. In such cases, it might be better to scale the 

5 values with a "squashing" function such as the sigmoid, which deadens the impact of 

6 extreme values; one such configuration would be the following: 

— age j - mean(age) 
age i 

age 

income} - mean(income) 



incomej - 

( 



® "income 



g age t ^income, 



l+e age > \ + e' 



income, 

J 



1 

8 Note that the mean and variance of the data points are used to fully normalize them, 

9 such that the sigmoid function will spread the values somewhat more evenly between 

1 0 zero and one. 

1 1 The previous approaches are especially useful for single numerical fields, which might 

1 2 well overwhelm each other if some sort of normalization isn't performed. 

13 A different problem arises for text or large categorical fields, since they can potentially 

14 consist of hundreds of coordinates capable of overwhelming the influence of single 

1 5 numerical fields. Suppose we believe the age of a customer is as important as the text of 

16 articles read. In such a situation, the thousands of coordinates devoted to the text field 

17 would dominate the metric' s behavior, negating any influence that age would have on 

1 8 our measure of similarity - clearly not a good situation. 

19 A solution to this would be to find the correlations among the fields taken separately, 

20 then average the result. That is, if each customer q = (age i? texti), where texti is a vector 

2 1 with a very high number of dimensions, we could define the metric: 
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corr(age j , age j ) + corr{text i , textj ) 



Where 



corr(c n Cj) = 
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i j 
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The result is a metric that gives equal influence to each field. 
(3) Forming Vectors Into Groups 

The process of classification is essential to collaborative filtering, as it allows different 
vectors to be formed into groups based on some measure of similarity. If we are able to 
create groups of customer vectors, for example, we can then give individual customers 
recommendations based on the patterns of their group-mates, who presumably have 
similar tastes. 

K-means Clustering and Nearest Neighbor algorithms are extremely useful for grouping 
purposes: previous iReactor patents give a full and detailed description of our 
customized versions. This section gives a brief overview of these methods. 

(3.1) Clustering 

K-means Clustering is an algorithm used to partition a coordinate space such that all 
vectors in a given partition are more similar to that partition's vector average (the 
centroid), than to the centroids of any other partition. It is a process that iterates over the 
following steps: 

0. "Seed" the coordinate space with the initial centroids, which are vectors used to 
describe the centers of the clusters, in the sense that they are the average of all the 
vectors currently assigned to the partition. This can be done randomly (assigning 
centroids random coordinates) if no other information is available, or it can be guided by 
pre-existing information. For example, if we wish to cluster vectors of music customers, 
we can use information about musical genres to create initial partitions that correspond 
to pop, gospel, classical, etc. This will locate the centroids in well-spaced intervals 
across the coordinate space. 
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1 1 . Assign vectors to the most similar centroids. This is done for each vector by scanning 

2 across all centroids and calculating similarity M(vector, centroidj ); once finished, the 

3 vector is assigned to the cluster whose centroid has the greatest similarity. In this stage, 

4 vectors may switch their allegiance from one centroid to another, if the relative distances 

5 to the vector have changed sufficiently since the previous iteration. If no vectors change 

6 their allegiance, the iteration process is complete, and the algorithm stops. 

7 2. If the iteration is not complete, recalculate the centroids by setting them equal to the 

8 average of those vectors that have been assigned to them. Go back to step 1 . 

9 Once the algorithm converges, the vectors are grouped into clusters. The centroids' 

10 coordinates as well as the identity of cluster members is useful information that can be 

1 1 passed on to subsequent stages of analysis. 

1 2 (3 .2) Nearest Neighbor 

13 The nearest neighbor algorithm, simply stated, creates a list of those vectors in a 

14 database that most resemble a particular target vector. This is accomplished by 

15 comparing the target vector, in turn, to every other vector in the database; the 

16 similarity between them is recorded, and once the comparison loop is complete the 

1 7 list of similarities is sorted. The top k members of this list are returned as representing 

1 8 those k vectors which most resemble the target. 
19 

20 o (4) Generalizing Across Databases 

2 1 One of the most useful aspects of SDI is that it allows for inferences to be drawn across 

22 different databases through underlying connections in membership or content. An 

23 especially strong link can be made between commercial databases if they have 

24 customers in common. However, for reasons of privacy, individual customers may 

25 choose to use different pseudonyms when dealing with different vendors. This might be 

26 preferred by the individuals, but it weakens the inferences that can be made between 

27 fields occurring in different databases. 
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1 The techniques chosen to infer correlations across different databases will depend on 

2 how many pseudonyms are shared in common. At one end of the spectrum, every 

3 customer uses a single pseudonym for all transactions, and makes an appearance in 

4 every database. At the opposite end of the spectrum, every customer uses a different 

5 pseudonym with every vendor, and may appear in only a single database. 

6 Case 1 : All customers use a single pseudonym, and appear in all databases considered. 

7 This is the simplest situation to handle. Since all customers appear in all the databases, 

8 the customer vectors' fields are essentially scattered across several locations, but can be 

9 easily reconstructed. For each customer, we define a new data vector that concatenates 

1 0 that customer's representation from across the different databases. 

1 1 Hence, if we are considering databases A, B, . . ., Z, and customer i appears in each one, 

12 we define a new vector a = (cai, c B i, ... , c Z i), where cm is customer i's vector in 

13 database A. We then proceed as usual, making inferences with these augmented 

1 4 customer vectors . 

15 Case 2: Most customers use a unique pseudonym, and frequently appear in different 

1 6 databases. 

17 In this situation, although we see some connections between the databases, many 

1 8 pseudonyms appear in only a single location. Using Bayesian techniques, however, we 

1 9 can still make predictions for customer vectors across databases. 

20 Suppose we have a set of databases, A, B, Z. Taking each database in turn, we 

21 cluster it using all available data. Thus, using every record in database A, we group 

22 A's customers into clusters 

23 Ai, A2, A n . Taking database B, we create clusters using all of B's information, 

24 creating customer clusters Bi, B2, . . ., B m , and so forth. 

25 Now, scan both databases for common pseudonyms (representing those customers who 

26 have interacted with both vendors under the same pseudonym) and create count 

27 variables wy to represent the number of pseudonyms that appear jointly in Ai and Bj. 
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1 We can now produce the probability that a pseudonym appearing in Aj will appear in 

2 Bj: 



/ | \ P\B:AA\ w. I total 
^A,) = L ~ 



P ( A <) f^wy/ total 

7 = 1 

toto/ = X X 

3 For example, if we have a database of airline ticket purchases and a database of 

4 restaurant visits, we can create clusters, in the first case, of customers who travel to 

5 similar destinations, and in the second case, of customers who eat at similar restaurants. 

6 Given that a particular customer belongs to a cluster of people who frequent Caribbean 

7 restaurants, we can infer which travel packages would most appeal to him based on the 

8 linking probabilities, as defined above. 

9 Multivariate Extensions: 

10 If we have a third database C, and there are a large number of pseudonyms common 

11 to A, B, C, the above probabilities can easily be extended. For example, knowing that 

12 a customer appears in Aj and Bj, we can calculate the linking probabilities to any C k : 
13 

u(r\A n\ V C *) ^ I total 

P \ A ^ B J) Y.w ijk Itotal 

p n m 

total = £ X X Wy 

k=\ i=i j=\ 

14 

1 5 Or, if there aren't many pseudonyms that span all three databases, the probability of 

1 6 Ck given that a pseudonym exists in A; and Bj could be approximated by: 
17 

^A^B^ p[c k \A)-l{c k \B) 

18 

1 9 Case 3: All customers use several pseudonyms, and none appear in different databases 
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1 In this situation, there are no common customer codes that can be used to create links 

2 across the databases. However, the mere fact that several databases have been brought 

3 together for analysis should imply that there are semantic commonalities in the data. 

4 Although each database contains different fields, it may be the case that those fields deal 

5 with related subjects. A human expert, knowledgeable in the content of the databases, 

6 the subtleties of the domain, and the overall goal of the analysis (e.g. the creation of 

7 recommendations), will be in a position to create a "common-information profile" that 

8 spans the databases. In essence, the common-information profile defines a format that 

9 allows vectors from different databases to share a common coordinate space. 

10 The idea is this: the expert designs a high-level vector format that embodies the content 

1 1 deemed important for the project goals. Next, for each database he develops a mapping 

12 that encodes the database's elements into the generic format. Finally, the desired 

1 3 analysis is performed on the full set of common-information profiles. 

14 Although the expert will have to create completely new fields for the common- 

1 5 information profile, certain types of data will map directly to the common-information 

16 format. In particular, if every database contains text (catalogued and counted, for 

1 7 TF/IDF purposes, by accompanying dictionaries), the union of the words will define the 

1 8 text coordinates of the new common-information profile. When word counts are being 

1 9 mapped from their original databases to the new vector, the original TF/IDF weightings 
- 20 may be used, or new TF/IDF weightings may be created (using a dictionary constructed 

2 1 from all the databases' text taken together). 

22 Once analysis has been performed, certain common-information profiles will be 

23 grouped together by their shared similarities, although the pseudonyms they represent 

24 may have been originally drawn from different databases. Such groups will represent 

25 links between different databases, and may be used for predictive purposes (see end 

26 of example). 

27 13*1 . 1 .6 Example of Cross-database Analysisln this example, suppose that the central SDI 

28 data warehouse contains data submitted by the following vendors: 

29 A. A travel agency keeps track of tickets sold, and vacation web pages browsed. 
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1 B. A bookstore keeps track of books sold, and stores an electronic version of the 

2 New York Times Review of Books. 

3 C. A sporting-goods and clothing shop, keeps track of purchase items sold (which 

4 includes magazines, for which electronic text exists). 

5 A certain airline wants to promote various vacation packages it has available, which include 

6 both European and Caribbean vacations, as well as singles and family packages. Although it 

7 has leased rights to databases A, B, and C, it turns out that no customer pseudonyms appear 

8 in more than one database at a time - in other words, there are no shared records. 

9 A vacation expert is hired to create a common-information profile. He creates the following 

10 information vector: (list of tropical countries, list of European countries, family score, list of 

1 1 sports, text) 

12 Note that the family score is a numerical value ranging from 1 (young singles) to 10 (many 

13 small children), and indicates what kind of person the customer is (a party-oriented student 

14 vs. a sedate father of three). 

1 5 The expert then creates the following mappings: 

16 A. Travel Agency. Link destinations of tickets sold to country fields (i.e., the number 

17 of trips to Germany by a customer would be placed in the Germany field of the 

18 common-information profile). Link sales of children's tickets, or requests for 

19 children's meals, to family score. Put web-page data into text field. 

20 B. Bookstore. Link travel books' text to country lists. For all books purchased by a 

21 customer, map text from book reviews into text field. 

22 C. Sporting-Goods store. Map warm-weather clothing (and swim gear) to tropical 

23 countries, ski gear to countries with skiing areas. Map sales of toys or children's 

24 clothing to high-value family scores, map revealing-bikini and student-discount 

25 sales to low- value family scores. Map text from magazines purchased by a 

26 customer to text field. 
27 

28 These mappings are then applied to each database, generating a full set of common 

29 information profiles. These are then clustered, forming groups that share commonalities. 
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1 The expert can now do several things with the results. First of all, he identifies the 

2 general "flavor" of each cluster (e.g., families with small children that enjoy winter, Europe, 

3 and skiing); the pseudonyms contained within each cluster can then be targeted for vacation 

4 packages suitable to their tastes. Secondly, the fact that pseudonyms from different 

5 databases have been clustered together allows the expert to plan cross-category marketing. If 

6 certain travel-book-buying parents have been grouped together with parents who bought 

7 their children swimsuits and scuba toys, it may be that they share a preference for family 

8 activities that take place in warm places or by the seashore. Hence, the book-users might be 

9 advertised various ocean-related sports goods appropriate for young families, and likewise 

10 the swimsuit-users might enjoy getting recommendations for travel books that describe 

1 1 tropical destinations that are especially fun for children. That is, if the goal is to cross- 

12 market items from A to customers in C, the most logical source of recommendations would 

1 3 be the people in A who have been grouped with the people in C. 

14 13.1 .1 .7 Methods for ValidationTo a large degree, the overall success of an SDI analysis is 

15 the relevance of the connections that are inferred from the data. It is often the case that a 

16 certain amount of validation is required to determine which analytical approaches are the 

1 7 most successful, given that the analyst has had to choose a particular combination from a 

18 wide range of algorithms, data sets, levels of granularity, and parameter settings. The 

19 process of validation measures the relative success of a given project, and is used to guide 

20 the analyst through further iterations of tuning and adjustment so as to optimize the final 

2 1 results of the analysis. 

22 There are two general approaches, not necessarily mutually exclusive, to validation: the first 

23 is fairly quantitative, the second relies more on human expertise and intuition. 

24 (la) Quantitative Approaches- Test Against a Validation Set 

25 Suggest the use of a human to infer the most relevant attributes influencing a particular 

26 output and the role of experimental design (as suggested) as a way of holding out part of 

27 the input data in order to validate the key relevancy of particular attributes. Principal 

28 components analysis is also implied in the last paragraph of 1 -1. 
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1 The goal of validation, in this context, is to measure how successfully SDI makes a 

2 prediction, most commonly a recommendation. Before a recommendation system can be 

3 used commercially (when it is exposed to actual customers), it is important to make sure 

4 that it is using the best possible combination of algorithms, input data, and parameter 

5 settings (e.g. TF/IDF tuning). If several different combinations are under consideration, 

6 there is a need to gauge the relative predictive accuracy of one approach over another. 

7 This can be accomplished by holding out part of the data set, training the 

8 recommendation system on the remainder, then evaluating the strength of the 

9 recommendations made for the hold-out set. 

1 0 Suppose we are testing two possible settings for a system that recommends music. We 

1 1 make a copy of the customer purchase records and remove a single purchase at random 

12 from each customer - this slightly reduced copy will serve as our training set. We then 

1 3 allow the two rival systems to recommend musical albums for each customer, based on 

14 the information in the training set alone. Typically, these recommendations will take the 

1 5 form of a list of items with corresponding numbers that indicate the strength of each 

16 recommendation. The relative performance of a set of recommendations can then be 

17 gauged by looping across each customer, noting whether or not the system 

1 8 recommended the item that had been held out, and if so adding it to a running total. The 

19 system with the highest total can thus be judged the most effective, since it most 

20 strongly recommended items that the customers did, in fact, end up purchasing. 

21 Because the result of this type of validation is a quantitative score, it is possible to 

22 automate the model selection process. Given a set of analytical approaches (each with its 

23 own array of parameter settings), it is possible to loop through the full parameter space 

24 (using a grid of evenly spaced numerical values, if needed, to reduce dimensionality), 

25 computing a validation score at each iteration. Those combinations of algorithms and 

26 parameter settings that demonstrate the best performance could be chosen as the top 

27 candidates for the final system configuration, since they do the best job at predicting 

28 customer behaviors. 



29 (lb) Quantitative Approaches - Dynamic Method 
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1 The problem with the hold-out approach to validation is that it isn't dynamic, since it 

2 doesn't reflect the impact that the recommendation system has on the customers once it 

3 is implemented, and may be based on data that doesn't contain current trends. After all, 

4 it is better to predict what the customer will buy rather than what the customer has 

5 bought in the past. 

6 A better approach is to run a controlled experiment against the actual customer base. 

7 First, the pool of customers is split at random into different segments. Next, each 

8 approach under consideration is used exclusively to make predictions for a given 

9 segment. Once the trial period is over, each system is given a score based on how 

1 0 valuable its recommendations turned out to be (this could be measured by total sales 

11 generated, for example, or by the number of times a customer made use of a 

1 2 recommendation). 

1 3 (2) Human Expert in the Loop 

14 Although quantitative methods can automate the validation process to some degree, at 

1 5 the beginning of many projects there is so much raw input data available and so many 

16 decisions that have to be made about the analytical approach that an automated 

1 7 process would have to test a prohibitive number of combinations of data, algorithms, 

18 and parameter settings to get optimal results. In such cases, it is useful to employ a 

19 human expert who understands the psychology and nature of the particular domain 

20 being analyzed. 

21 Such a person will have intuition about what is and isn't relevant for his domain. For 

22 example, a movie expert might be called in to work on a movie-recommendation 

23 system, for which an immense amount of input data is available. In choosing relevant 

24 fields for analysis, the expert's understanding of cinema would lead him to include the 

25 director's name and numbers of Oscars awarded, whereas the exact length (in minutes) 

26 of the movie would be, in his estimation, irrelevant and therefore excluded. 

27 Once the analysis is complete and recommendations have been made, the expert's 

28 opinion (based on a qualitative understanding of the domain) can be used to guide which 
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1 particular combination of settings, chosen from a list of candidates with detailed test 

2 outputs, should be used for the recommendation system. 

3 (3) Combined Human/Quantitative Method 

4 There is certainly no reason why both approaches couldn't be used in combination. 

5 Many data sets include fields that are extremely noisy or simply irrelevant to a given 

6 problem; a human expert can be employed to pare the data set down to a reasonable 

7 size and dimensionality, using his domain expertise to create a data model reasonable 

8 for the proposed analysis. Next, automated methods can be used to fine-tune the 

9 parameter settings and to choose which subsets of the input data are the most useful. 

10 Finally, the human analysts called back to qualitatively evaluate the results of the 

1 1 fine-tuning, making the decision to either start a new iteration of the analysis, or to 

12 certify that the process is complete and ready for commercial application. 

13 13.1.2 The Iamworthit System Iamworthit is an application of Secure Data 

14 Interchange to business-to-consumer (B2C) e-commerce, where individuals perform 

15 transactions on-line through interactions with the server machines of on-line vendors. 

1 6 Individuals interface with the servers of vendors via local user client machines, networked at 

1 7 present over the Internet, although this is not a necessary infrastructure for the invention. 

18 Iamworthit allows individuals to receive payments for the information that is 

19 collected by an SDI client proxy and stored in the central SDI data warehouse, for data 

20 mining purposes. Iamworthit also allows individuals to receive payments in one-to-one 

21 interactions with vendors in return for providing vendors with information that allows them 

22 to make an appropriate offer. Secure data interchange supports the useful exchange of 

23 information between agents without allowing vendors to collect and distribute information 

24 about users without the knowledge or permission of users, as is possible now via the system 



25 of cookies and affiliate networks, such as that operated by www.doubleclick.com . In the 

26 system of Iamworthit vendors benefit through well-targeted advertising (both push and 

27 pull), the ability to customize information and services (even to first-time customers), and 

28 access to a large database of information about buyer purchasing habits. Individuals still 

29 release profile information to the central SDI database for the purposes of controlled access 

30 by vendors, and also provide vendors with profile information during interactions and allow 
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1 vendors to execute queries on information and receive the benefits of personalization 

2 without directly accessing the information. Users can maintain multiple pseudonyms and 

3 profiles, but within SDI vendors can still access information across pseudonyms and use 

4 cross-web (broad) and single- vendor (deep) information to build robust models of buyer 

5 behavior. Buyers benefit through personalization with privacy, and financial rewards in 

6 return for releasing profile information. 

7 Within B2C e-commerce, secure data interchange can also support a system of 

8 time-of-purchase competition, which allows an individual to use the profile management 

9 capability of SDI to provide vendors with information about a user at the time of purchase, 

1 0 and allow competing vendors to offer the same product at a better price, or a better product 

1 1 (for the user) at a good price. With time of purchase small entrants to the marketplace can 

12 make counteroffers to users that are about to purchase a product or service from another 

13 vendor, and can compete in small parts of the marketplace without investing heavily in 

14 advertising and brand awareness. Time-of-purchase requests can be made by user clients to 

15 iamworthit, to request that iamworthit cascades purchase requests onto other vendors, 

16 collects responses, and then return them to user clients. This extension of SDI is discussed in 

17 Section 2.1. 

18 As a commercial strategy, one might make it a necessary condition of belonging to 

19 Iamworthit that client machines submit profile information to the SDI central data 

20 warehouse, so that vendors can perform useful data mining and then provide customized 

2 1 products and information to users, for example based on models of collaborative filtering. 

22 13.1.3.1 System Overview 

23 Client machines in Iamworthit implement an SDI proxy on-top of a user's regular 

24 web browser. The role of the proxy is: 

25 a) Profile and identity management as the individual interacts with server computers 

26 of on-line vendors. 

27 b) Collect and manage profile information, with information periodically submitted 

28 to the central SDI data warehouse. 

29 c) Release profile information to vendors according to profile management policies. 
30 
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1 In Iamworthit the client-side SDI proxy is configured by users to periodically push 

2 information collected about the user to the central SDI data server, i.e. click stream data, 

3 profile information, purchases made, information requested, etc. Profile information is 

4 associated with price-rules, as in the top-level description of SDI, and can also be adjusted 

5 with random perturbations to protect the identity of a user under multiple pseudonyms. 

6 The client periodically sends update messages to the database, for example with information 

7 about new activity (e.g. web browsing, purchases, adverts not selected, etc.) SDI allows the 

8 information to be used by vendors to build models that enable good personalization of 

9 products, services and adverts. 

1 0 Users might choose to randomize profile information to protect sensitive information 

1 1 and to prevent vendors linking users across different pseudonyms. Clients submit perturbed 

12 data points and the distribution that was used to generate the noise (see the section on 

1 3 random noise perturbation). 

14 13.1.3.2 Privacy-protected data mining 

1 5 Vendors can gain access to profile information in the central SDI data warehouse, as 



1 6 long as they meet the criteria of the agents that submit information on behalf of users, and 

1 7 pay the price of data access. The profile information can allow vendors to build complex 

18 models for personalization, negotiation, and advertising, based on information about 

1 9 purchases made by different types of users. 

20 Users submit profile information to a central database that can be used for profiling, without 

21 revealing their identities. Client-side SDI proxies track user activity on-line, across multiple 

22 sites, and submits data, randomized if necessary, and only according to a user's preferences 

23 to the central SDI data server. The data is useful for building models of buyers, for example 

24 purchasing patterns, for the purposes of personalization of information and adverts. Agents 

25 that submit information retain ownership of the data. 

26 Simple data mining queries include: 



27 (a) Compute the average income level of people purchasing camcorders. Suppose 

28 that a vendor has pseudonymous identities of its recent customers, and wants to 

29 compute their average salary. It does not know the salary of any of the customers, 

30 and individual customers will not release their salary to the vendor. However, if 
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1 some of those customers have provided information about their salary to the 

2 central SDI data warehouse then the vendor can compute the result to its query. 

3 Agents might associate price rules with information about their salary that allow a 

4 vendor to query that information so long as the identity of the agent is not 

5 revealed, i.e. so long as the salary is provided anonymously. We described earlier 

6 in the patent description how price rules can define different prices for different 

7 types of information access. 
8 

9 (b) Compute the total donation to a fund for bone cancer research. Individuals do not 

10 wish to release information about their specific charitable donations, but might be 

1 1 happy to release that information anonymously. Again, a query to compute an 

12 average donation can be formulated and executed in the central SDI data 

1 3 warehouse. 
14 

15 (c) Perform collaborative filtering across multiple fields. This is explained in more 

16 detail below, and is possible within SDI because a general query can be 

17 performed so long as the querying agent makes payments for data access in 

1 8 accordance with the price rules of agents which submit information to the shared 

19 database. Note also that randomly perturbed data does not prevent the 

20 computation of correlations between fields, so long as the random perturbation is 

2 1 "small" with respect to the value of the data record. 

22 13.1 .3.3 Importing Off-line Data into the Data warehouse 

23 The central SDI server can associate off-line information about a user with a 

24 user's on line pseudonymous profile, even though the central server does not know the 

25 user's pseudonym IDs. This can only be done with the user's consent, and may also 

26 involve appropriate compensation. Within the system of iamworthit we can credit users 

27 for both off-line and on-line information. 

28 Merging a marketing database with SDI user profiles can be useful both to initialize 

29 the database, for example when asking a user questions to generate an accurate user profile 

30 rapidly and efficiently. Off-line data can also add useful richness to on-line profiling 
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1 information, which may be largely contextual and low on details/factual information. For 

2 example, off-line data can include information such as whether a user owns a car, rents an 

3 apartment, has house insurance, life insurance etc. SDI can also extrapolate correlations to 

4 other user profiles, on the basis of common SDI-profiles, for example using statistical 

5 techniques. 

6 It is often the case that individual customers appear in some databases, but not in 

7 others. Under normal circumstances, an analyst working across different databases would be 

8 faced with a large number of incomplete customer records, each with gaps corresponding to 

9 the fields of the databases to which they don't belong. A solution to this problem is offered 

10 by SDI, which is capable of drawing correlations between different databases - this 

1 1 information can be used to generate predictions to fill in the gaps of incomplete customer 

12 records. The result is a full set of customer records that can be meaningfully sorted or 

1 3 filtered by any of the combined fields, and which can now be handled as a unified set of 

14 data, suitable for use by standard database analysis systems. 

15 In a typical example, SDI might be used to combine a demographic database, such 

16 as the one offered by the Econometrics Corporation, with a commercial database, such as 

17 the one offered by Claritas. The Econometrics database consists of 180 million different 

18 customer records, but at a fairly coarse-grained level of detail, consisting of such 

19 information as age, gender, family status, location (at the state, city, or zip code level), and 

20 personal income. In comparison, Claritas offers a smaller base of customers, but includes 

21 information of arguably higher quality, since it breaks customers down to the geocode (sub- 

22 neighborhood) level, and includes much more detailed information on personal spending 

23 habits across hundreds of different purchase categories. A logical reason to combine these 

24 databases would be to supplement information about customers in the vastly broader 

25 demographics dataset with particular predictions about their personal preferences and likely 

26 commercial spending habits. One could imagine using this augmented data set to support a 

27 web site that instantly customizes itself to new visitors' preferences. Since the number of 

28 records in the Econometrics database is equivalent to roughly 72% of the population of the 

29 United States, it is likely that most first-time visitors to the site will already have a 

30 "thumbnail sketch" in the system, and can thus be greeted with an page appropriately 

3 1 configured to their personal tastes. 
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1 The technical details of the combination process (which have been described 

2 elsewhere in the patent) to a large degree depend on the amount of overlap between the 

3 databases, that is, the number of customer records which are shared in common. 

4 Suppose the demographic databases' fields are coded (xl, . , xn), and the commercial 

5 databases' fields are coded (yl, ., yn). Suppose further that customers in set A appear only in 

6 the demographic database, customers in set B appear only in the commercial database, and 

7 customers in set C appear in both. 

8 The process of supplementing the fields of customers A depends completely on the 

9 derivation of the distribution f(yl, ., yn | xl, . , xn), which describes the correlation of fields 

10 in the commercial database on fields in the demographic database. As previously discussed 

1 1 in the patent, different techniques may be used to create this distribution, depending on the 

1 2 size and variety of C. 

13 As a concrete example, one could imagine that set C includes customers from rural 

14 areas. The demographic database would reveal that, although their incomes aren't huge 

1 5 relative to the national average, they tend to spend a lot of it (i.e. are active consumers), have 

1 6 large families, and purchase large vehicles. The commercial database might show that they 

17 enjoying hunting magazines and Ford trucks. If they live inland, they buy hunting 

1 8 equipment, if they live near the ocean, fishing equipment. 

19 If these trends are dominant in set C, they will impact the distribution function. 

20 Thus, when a browser from a small town in Texas with a typical income pattern visits the 

21 automated website, he could be greeted with discounts on truck accessories and a small 

22 sidebar with news on the hunting season. On the other hand, a visitor from a small town in 

23 Maine might be given the same truck discounts, but would have news on the fishing season. 

24 Although the demographic dataset is arguably the weaker of the two in terms of content, the 

25 fact that it contains even a small amount of information on most people in America makes it 

26 very valuable for handling first-time visitors, since most of them will appear in it. By using 

27 SDI to leverage the more detailed information in the commercial database, we are able to 

28 supplement the rough demographic data with predicted commercial preferences. This allows 

29 us to construct more detailed thumbnail sketches for each customer, allowing our reception 

30 of first-time visitors to be much more appropriate (since knowing personal hobbies or 

3 1 interests tells us much more about a person than general income level). 
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1 13.1.3.4 Static Query Execution: Central ApplicationsSome key applications of SDI are: 
2 

3 o Assessing the Value of Data. Plug together sets of data, and measure predictive 

4 accuracy. 
5 

6 o Matching Data Across Vendors. Find patterns in common pseudonyms, denoting 

7 common areas of interest; use catalogues of order codes and item description to 

8 find similarities across data sets. 
9 

10 o Targeted Recommendations; e.g. match customers to their nearest neighbors in a 

1 1 data set and generate recommendations for users, collaborative- filtering style 

12 application. 
13 

14 o Leveraging Portal Data. Use data from portal to leverage data needs for ISP 
15 

16 o Analyzing Affinities. Suppose a vendor has a list of customers, and knows to 

1 7 some degree what web pages they visited after leaving vendor site. A large 

1 8 collection of customers taken from an ISP will contain their web-surfing behavior. 

19 Cluster web sites and cluster customers, finding cluster-to-cluster interactions. 

20 Use this information to classify vendor's customers; gives vendor an edge in 

21 knowing customers' tastes. 
22 

23 There are a number of search-based applications, where SDI searches for appropriate 



24 profiles and then requests that SDI makes contact with the users pseudonymously, i.e. 

25 without the vendor receiving any useful information about a user's identity. The contact, 

26 interaction, and business relationship with the vendor occurs under terms of complete buyer 

27 pseudonymity. In accordance with the parent patent application [INSERT US PATENT 

28 No.] the pseudonymous communication may be either email, real-time text 

29 communications, voice (such as the pseudonymous telephony or Internet telephony). In the 

30 case of pseudonymous telephony, instead of a one-time or persistent pseudonymous buyer 
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1 address, pseudonymous buyer telephone numbers may be used for the third party to reach 

2 the buyer under his/her terms. Example applications include: 

3 (i) Financial Advice and Financial Planning Services. Often buyers are quite 

4 sensitive about the confidentiality of the release of this type of information 

5 related to personal financial matters and particularly with certain matters (and 

6 perhaps in general) prefer that their financial advisors were unaware of their 

7 true identities. Similarly, investment advice or sales communications by stock 

8 brokers are another application where similar buyer information is typically 

9 disclosed. 
10 

1 1 (ii) Insurance Agents & Brokers. For many types of insurance, (e.g. health, life, 

1 2 casualty) personally sensitive information is disclosed by buyers to their 

1 3 agents and brokers. Initially, before insurance services are purchased, it is 

14 possible that useful detailed quotes and/or insurance advice could be provided 

15 to a buyer pseudonymously. 
16 

1 7 (iii) Legal advisors. There are a variety of legal disciplines in which the associated 

1 8 legal services delve into highly sensitive personal information (e. g., 

19 bankruptcy law, divorce law, criminal law, etc.) Many lawyers also offer to 

20 first-time prospective clients a free consult in which such a privacy-enhanced 

21 communications system could be initially beneficial to the parties. 
22 

23 (iv) Family Counseling and Psychological Counseling. The parent patent 

24 application also suggests these applications which often involve the exchange 

25 of highly confidential personal information. 
26 

27 (v) Medical Consultations Involving Drug Prescriptions. In this variation, 

28 although medical consultation may be conducted anonymously, in order for a 

29 physician to prescribe medication, the identity of the patient must be known 

30 (within the current regulatory legal requirements). 
31 
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1 (vi) Advertising network (where the clustering or nearest neighbor algorithm 

2 interact with the ad server). Ads on that ad server's database, e.g., on the sites 

3 most frequently visited by the user are periodically uploaded to SDI such that 

4 the ad(s) of highest predicted interest to the user is presented upon the user 

5 visiting that site. Preferably the target object profiles of all the ads on the ad 

6 server database, as well as the profiling algorithm, which is used by the ad 

7 server is properly integrated with that of SDI so that it is possible to convert 

8 the usage statistics of the data model used by the ad network (which may 

9 include ad server data for users collected across the ad networks) into useful 
10 statistics for SDL 

11 

12 (vii) Content sites, e.g., for personalizing news articles (which are again uploaded 

1 3 to the SDI server each news day if those articles would not otherwise be 

1 4 available on that web site. If they are, as with any web content format which 

1 5 is universally accessible, the content may be frequently profiled following 

16 retrieval, e.g., by a simple web crawler script. 
17 

1 8 (viii) Portal Sites - As detailed in the parent case, the user profile may be presented 

19 to the information vendor server containing the portal site. The selection and 

20 prioritization of the most relevant personalized content categories may be 

2 1 automatically determined at the client level proxy, which is conveyed to the 

22 information vendor server in order to personalize every aspect of the portal 

23 interface. It should be noted that without the vendor's cooperation (with SDI) 

24 the interfaces of portal sites and content sites could be automatically modified 

25 at the client-level proxy in order to provide the personalization of selections to 

26 the user (from the user's perception as if the personalization was dynamically 

27 generated on the site itself). However, ad servers/ad networks could not be 

28 personalized as easily without privileged access to the ad database (unless the 

29 ads from that site were recorded and stored in the ISP-level proxy from other 

30 users visiting that site). 
31 
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1 (ix) Rating Sites. The ISP level proxy may also contain metadata relating to 

2 specific sites, products or informational content (as an alternative to the 

3 metadata residing on the HTML pages, which they describe through 

4 cooperation with the vendor). Such metadata includes annotations, average 

5 user ratings, according to a variety of attributes, as well as the ability to 

6 browse Web pages with associated annotations and/or user ratings provided 

7 from those users who are identified by either: a user-selected cluster, or the 

8 user's profile or high or low ratings according to site selected. Endorsements 

9 (and ratings by a variety of relevant criteria) by a variety of types of 

10 organizations may be available as well, and browsing and searching may be 

1 1 performed with these organizational endorsements/ratings as a criteria to bias 

12 or filter searches or filter the sites accessible via browsing interface (the same 

1 3 may be also performed for user based ratings as well). 
14 

1 5 (x) Cache Engines - As an alternative to the profile generation and processing 

1 6 modules running on the network vendor (ISP) servers, it may operate instead, 

17 or in combination as a distributed process upon the caching server in 

1 8 accordance with the method taught by "Broadcast System for Reduced 

19 Memory Devices and Asymmetric Networks", (check exact title of this 

20 patent) As taught in this patent (in accordance with the current applications) 

2 1 the pages predicted to be accessed by the user on the following day are 

22 precached in advance. 
23 

24 Typically sites which tend to be visited most frequently are assessed for 

25 selections which are personally relevant but have never been previously 

26 accessed. These would be presented as recommendations to the user if/when 

27 he/she visits that site (which is also probabilistically determined). The fact 

28 that the recommendation (whether generated by the site itself or modified at 

29 the client level proxy), the user's behavior is positively being reinforced by 

30 the fact that the ranking of personal recommendations is prioritized in direct 

3 1 relation with the prioritization scheme for precaching, i.e. personalized 
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1 recommendations and precached pages are substantially identical and as a 

2 result of precaching to the server, or even the client, can be accessed by the 

3 user with little or no latency (thus in the ideal embodiment it may even be 

4 advantageous to highlight the links which have been precached). 

5 In a preferred commercial model, the techniques which are above described, 

6 that meta data containing profile information on the pages are encrypted such 

7 that a cache engine that is used to precache cannot be decrypted and read by 

8 certain caching engines which do not possess a desired business relationship 

9 with iamworthit. In another implementation, other competitive intermediary 

10 services to iamworthit may not be able to decrypt and read these web page 

1 1 profiles in order to present personalized information to the user (even if the 

1 2 competing protocol were pursued, the accuracy of the iamworthit version 

13 would be substantially more accurate due to its ability to leverage vendor 

14 centric SDI data from the host level proxy. 
15 

1 6 (xi) Auto Insurance Application 

17 Co-pending patent application entitled "Applications for a Location Enhanced 

1 8 Information Architecture" describes a location-enhanced framework by which statistical 

19 methods are used in order to very efficiently and confidently extrapolate the most 

20 relevant attributes in predicting automobile accidents (or the avoidance thereof). The 

21 correlations from some of the existing metrics used may be refined using this technique 

22 e.g. LEIA is able to accurately determine the number of miles a buyer drives per week 

23 while the buyer will often lie about this, thus the basic model may be refined and more 

24 accurate information may be provided on a per-buyer basis. The scheme also enables 

25 completely new metrics to also be identified and utilized as well which may correlate the 

26 attributes location with time. It is conceivable that if a buyer provides access to this 

27 location-enhanced information by an insurer, that the insurer could in turn offer 

28 premiums, discounts or deliver credit to the buyer which could be added to monetary 

29 credit the buyer receives for personal information from iamworthit, for example, an 

30 iamworthit implementation which uses LEIA to profile and target buyers with ads by 

3 1 their location (e.g. while riding in an automobile). 
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1 

2 (xii) Hospital example. 

3 In one data application of the client level proxy server, the user profile includes medical 

4 data which is obtained from medical records, (such as from hospitals or physician's 

5 medical records or potentially that of a health insurer). Typically, various physician's 

6 offices and hospitals which a patient (hereinafter ''user") has visited over the years 

7 contains separate portions of a user's overall medical history, thus these various sources 

8 may be combined upon the user's request by downloading this data to the client-level 

9 proxy (or preferably, the user enters into a contract with those organizations in which all 

1 0 medical data and updates thereof are downloaded by the organization and/or an "agent" 

11 to the organization which transmits a request which is digitally signed by the user at the 

12 client-level proxy server. The origin of the request (the user) is authenticated and may 

1 3 be processed by a human or another agent located at the organization's host computer. 

14 Because of the highly sensitive nature of medical data, there are potential user privacy 

15 advantages in using randomized aggregates. For example, a user's age, medical history 

16 of specific relatives (particulars of which could be more generalized) genetic data, 

1 7 numeric values associated with various medical tests, results for which are a numeric 

1 8 value. This data may be of relevance to pharmaceutical companies, alternative medicine 

19 vendor and clinics insurance companies hospitals physicians, clinics and home health 

20 care providers, the latter three of which may wish to advertise to patient prospects and 

21 extend their medical practices. The privacy architecture herein provided is a critical 

22 component for enabling access to user data by these commercial entities and is perhaps 

23 critical to the extent that users must explicitly authorize the transfer of medical records 

24 from either the physical copying or electronic duplication of an associated transfer of 

25 such information to an intermediary (SDI) which is trusted by the user. 

26 (xiii) Medical information, such as medical conditions, medical history, active 

27 prescriptions, drug reactions, family history, possibly even genetic pre- 

28 dispositions (from a genetic profile). Medical insurance information may also be 

29 potentially useful for a prospective qualified accessory to be able to readily access 

30 in case of an emergency. 
31 
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1 (xiv) Physical location information — Users or advertisers could, for example: a) 

2 Query a pseudonymous user database to access profiles that are in close physical 

3 proximity and match certain criteria, e.g. live in a certain geographical region, had 

4 recently attended a meeting or event (or is planning to attend a particular event) 

5 had recently communicated with a friend or associate. In another variation, a user 

6 could for example, submit a query pertaining to every user in a particular physical 

7 space, e. g., a room, hotel or convention center, e. g., identify all users present 

8 here who attended Internet World, 1995. 

9 13.1.3.5 Buyer Infomediary 

1 0 The central SDI data warehouse can be used by vendors that provide services of 

1 1 buyer infomediaries, for example providing buyers in B2C e-commerce applications with 

12 historical information about previous purchases of users. At present companies such as 

13 www.priceline.com make profits because many individuals post take-it-or-leave-it buy 

1 4 offers for goods that are above the reservation price that vendors are prepared to accept for a 

1 5 service. An information infomediary, based around information submitted by users about 

1 6 previous successful or unsuccessful bids would provide for a more efficient marketplace. 

1 7 A buyer infomediary system can be built as an application of SDI to providing bid prices for 

1 8 agents in electronic marketplaces which are agent-mediated and allow dynamic pricing, for 

19 example the nascent marketplaces of on-line auctions, www.ebay.com and on-line reverse 

20 auctions, www.priceli ne. com . 

21 13.1.3.4 Seller infomediary 

22 On the flip-side, the system of SDI can also be used to support third-party 

23 infomediaries for sellers, that make their business by performing data mining on the 

24 information in the shared data warehouse, and can sell that information onto other vendors. 

25 It is possible to identify useful pricing models, based on historic purchase information from 

26 many different individuals. 

27 The system of SDI can also be used as a confidential database for the purposes of 

28 generating statistics from sensitive data. For example, as a trusted system, manufacturers 

29 might be willing to provide information about their productivity, margins, retainment rates, 

30 production efficiencies, yields etc. The central SDI server could generate statistics, globally 
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1 for the manufacturing sector, and then individually for each manufacturer — as it relates to 

2 the information provided by other companies. Similarly, it would be possible to use such a 

3 system to compare salaries across different universities. While an individual university 

4 might be reluctant to reveal information about its pay-scales to other universities, in the 

5 aggregate this information is not sensitive — and a survey on salary can be useful to both 

6 employers and job candidates. SDI is used to securely calculate statistics, without revealing 

7 any information that might compromise the privacy of a single employer. 

8 13.1.3.5 Personalization Application. 

9 Personalizing information on-the-fly requires that a vendor has a data model, for 

10 example that clusters its current user-base according to what they are likely to be interested 

1 1 in. We can allow vendors to execute queries, for example clustering algorithms on profile 

12 information, that use all the information submitted to the data base by users without 

1 3 violating privacy policies of the owners of that information. This can be done by returning a 

14 classifier, that will place new profiles into relevant clusters, and provide information about 

15 the likely interests of a particular user. Given this classifier, we describe in Section 3, 

16 "client-side data mining", a method that allows the vendor to compute the results of a 

17 personalization algorithm on the profile of an agent without receiving that profile. Simpler 

1 8 methods might either run such a personalization query in the central SDI data server, or 

1 9 release some profile information to the vendor. 

20 Vendors are assured that all enabled users (who are subscribed to iamworthit) will 

21 receive highly personalized recommendations for their products, services and advertising 

22 access from their site, (and because many sites are migrating towards providing also a 

23 full-blown niche portal with content, chat and forums,- personalization of this 

24 information is enabled as well). In addition to leveraging click-through navigations and 

25 transactions captured from their own customers during present and previous visits to their 

26 site, iamworthit provides the unique advantage of being able to utilize the user's web- 

27 wide profile consisting of a variety of attribute types upon accessing the site in order to 

28 leverage statistical correlations captured by the integration of the statistical models of the 

29 vendor, with that of Iamworthit. If stored on the SDI main server, the iamworthit data 

30 captured at the client level or ISP-level proxies is typically concatenated with the data 
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1 sets of the vendor captured from the host- level proxy (maintained at the vendor's host 

2 server or if SDI enabled at the SDI main server) This is particularly important for first- 

3 time visitors to the vendors site and particularly within vendor domains in which content- 

4 based merchandise (e.g. books, magazines and news subscriptions) are sold. Assuming 

5 the number of iamworthit subscribers is substantial enough and assuming that the present 

6 business model involves a "relationship" between iamworthit and a vendor of 

7 recommender systems and/or (at an arms length) the vendor-centric SDI service, this 

8 advantage alone could impose significant pressure upon the vendor to deploy a 

9 recommender system(s) which interoperates with iamworthit (or subscriber to SDI). 

10 Content sites, news sites, and web portals, etc. can provide personalized information to 

1 1 users by subscription to the system of SDI, which provides access to customer profile 

12 information. 

13 13.1 .3.6 Example: Customized Web Pages. 

14 Analyze customers for broad preferences in choice of web pages visited (corporate, 

1 5 Star Trek fan, etc.). This defines the initial look and feel for the page that greets them at their 

16 portal (a teen might enjoy lots of bright colors and sound clips, an investor would prefer a 

17 more staid design); different "skins" could be created to match the major categories of 

18 customers, and would designate both the graphical design and modules available on the 

19 page (e.g., a working stock-ticker for an investor, a real-time weather map for a jogger). The 

20 web pages and information most frequently accessed by a customer would be given priority, 

21 and a hierarchy of usage could be developed. Since stock prices are of the highest 

22 importance to an investor, a ticker reflecting his portfolio value would stream across the top 

23 of the page. However, although he enjoys spending his profits on vacations and 

24 automobiles, these are only of secondary interest to him (as revealed by his on-line 

25 behavior), and so are relegated to a sub-menu on his web-page. As his usage changes, the 

26 priority level assigned to the modules would change as well, so that when a jogger 

27 purchases a treadmill for indoor running, his weather reports won't dominate the top-level 

28 screen. 

29 Small children could have simplified browsers, with extra-big buttons and access to 

30 pages pre- screened by a "web-nanny" service. 
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1 SDI would be used in the initial phases to group customers into general categories 

2 based on their patterns of their web surfing, and would be used in later phases to adjust the 

3 content and style of their portal home-pages (based on what similar customers seem to be 

4 enjoying). SDI can improve upon this rule engine-based model by giving far more 

5 information about the users who tend to perform certain actions on their sites. Moreover a 

6 second (outside) vendor which is syndicating products or advertising on the vendor's site 

7 could (upon farther approval of the hosting vendor) use data mining of the hosting vendor's 

8 site and based upon explicit actions of users on his/her site and/or the user's profile ascribe 

9 rules for more explicit criteria for targeting these remote prospects. Of course, response of 

10 users provides a means for iteratively refusing these rules explicitly (or alternatively 

1 1 implicitly). 

1 2 The Secure Data Interchange clearly offers significant potential value to a vendor 

13 licensing rules engine in many ways. As an additional example, many vendors also use a 

14 site traffic reporting tool in order to analyze traffic patterns through their sites. It is very 

1 5 helpful for vendors to analyze, for example, how often certain pages become clicked-on or 

16 transactions occur or conversions occur, how moving the link may affect (typically improve 

1 7 ) these patterns and how certain rules (which may dynamically create links or move links) 

1 8 affects these patterns. Of course, observing the movement patterns of users throughout the 

19 site helps the vendor better understand user's perception of the relatedness and natural 

20 progression of page accesses throughout the site. This is particularly so if different rules are 

21 experimented with such that optimal click-through or transactions occur particularly for key 

22 pages or purchasables. Adding to this statistical data SDI user profile information is able to 

23 reveal (via data mining) much more robust relationships between pages as these 

24 relationships change with user profile features. Accordingly appropriate rules may be 

25 provided for individuals based upon this general user profile information which is gleaned 

26 from interests and behavior before visiting the site, i.e., either from the user-centric or 

27 vendor-centric SDI (while the user visits the vendor's site or alternatively accesses an 

28 affiliate site). 

29 In another variation of SDI we can highlight content on web pages with information 

30 that might be relevant to a user, even when the source of the web page is not personalized. 
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1 This can be done via collaborative filtering techniques, which might bring in feedback and 

2 comments from other similar users within SDI that are stored in the central SDI database. 

3 The iamworthit (user side) SDI database in cooperation with the vendor centric SDI 

4 service can sell to the vendor centric SDI service or other industry or market research 

5 organizations strategic information about the comprehensive behavior activities and user 

6 profiles of visitors and customers of these vendors (as is suggested earlier in the spec). 

7 Additionally, targeted survey questions may be presented on behalf of these entities in 

8 order to extract further information which may be correlated with certain features and 

9 attributes of these users. One such method, rapid profiling is detailed in the parent 

1 0 application [INSERT US PATENT NO]. 

1 1 Vendors can utilize SDI's data mining interface to observe content and product 

12 consumption affinities based on user profiles describing Web wide behavior. This 

1 3 interface may also enable vendors to observe comparative click through and/or 

14 transaction rates of their competitors, how these criteria are affected by user profile 

1 5 attributes including geographic criteria (if relevant to that vendor) as well as how these 

1 6 criteria, (including even individual multi vendor customers) are affected by various types 

1 7 of recommender and rule based engines and further the particular rules and weighted 

1 8 feature correlations used in generating these recommendations (again as described above, 

1 9 these rules and feature relationships may be derived automatically through a core sample 

20 of users, who are iamworthit subscribers). A vendor can use statistical data to tune 

2 1 certain features and implement certain rules, either directly or with the help of a user 

22 centric SDI analyst. Moreover, direct real time response feed back may be collected on 

23 these manipulations to the data model in iterative fashion. 

24 13.1.3.7 Example: Ad networks. 

25 The system of SDI provides a method for an advertising network which allows 

26 users to receive personalized adverts that are relevant to their lives, and also allows 

27 vendors to boost profits by placing one-to-one adverts instead of the traditional broadcast 

28 nature of advertising in media such as television and newspapers. Each user can receive 

29 adverts that are tuned to his/her profile, but without losing control over the information in 

30 that profile. In fact, a user can configure price-rules to receive payment for receiving a 
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1 well-targeted advert. The trusted secure data interchange can operate as an "ad network", 

2 allowing for the placement of well-focused banner ads to market goods that are relevant 

3 to users of a particular content site. Electronic banner ads provide the potential for one- 

4 to-one marketing, when the advertising agency has information about the user that has 

5 just hit a site, together with information about what the user is doing local to a site. For 

6 example a car manufacturer is able to place a focused advertisement to a user that has just 

7 performed a search for new cars in a search engine, to a user that is known to have a large 

8 family and a high disposable income. 

9 Ads can be targeted to iamworthit users based upon their user profiles. Ad 

1 0 networks can use the profile information in SDI to target ads and reach their target 

1 1 customers. Ads can be targeted with statistical data across multiple vendors that are 

12 subscribed to SDI. Statistical data can also be used to determine estimated revenues for a 

13 vendor if she/he were to interoperate with iamworthit and subscribe to SDI. 

14 SDI might develop a decision tree to decide how to assign adverts to users with a 

15 particular profile. Within SDI we can track users across multiple domains, to allow fine 

1 6 level control over ads that the user sees. 

1 7 There are two possible business models. Firstly, an Internet content provider could 

18 purchase access to information placed by vendors and users within the Secure Data 

19 Interchange database. This information may be "rented" for a period of time, and then 

20 whenever a user visits the site of the content provider (possibly through the pseudonymous 

21 proxy server), the provider can query the data interchange for information about the user. 

22 The Internet content provider sells well-directed advertisements to vendors. Secondly, the 

23 data interchange could sell or rent data to an advertising agency directly, providing 

24 information in real-time to enable the advertising agency to provide more focus in its banner 

25 ads for its clients. "Per-transaction" pricing is a very powerful pricing model that is enabled 

26 with on-line banner ads. It is simple to monitor the number of click-through that are received 

27 at a particular banner, in response to an advertisement. In the off-line world pricing must be 

28 based on the number of impressions, or worst still, the number of mailings sent and it is 

29 more critical to understand the expected value of a campaign up front. 

30 The proxy server could also act as an "ad network" itself, and sell focused 

3 1 advertisements for vendors, and purchase ad-space on the sites of content providers. The on- 



Pagel62 of 370 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 line domain provides this unique opportunity for quick experimentation with advertising 

2 strategies in order to get feedback on the likely utility of untested approaches. The system 

3 can use a hierarchical cluster tree to identify the most revealing items in a dynamically 

4 responsive fashion such that the profiles of all of the selections can be generated with the 

5 most minimal amount of interactions with the user (see "Rapid Profiling" section in issued 

6 patent entitled "System & Method for Customized Electronic Identification of Desirable 

7 Objects). Thus a more robust statistical model across multiple vendors is established as a 

8 result of the user's click through response of these intelligently selected virtual banners as 

9 well as other pages which are subsequently navigated through once the remote site is 

1 0 accessed via the banner. 

11 In the preferred approach rapid profiling not only dynamically identifies and 

12 presents items which are most revealing of the other items in the collection, it also selects 

13 the users whose profiles suggest the greatest familiarity with these items (i.e., potentially 

14 correlated items). Furthermore, if the system's objective is to find new users or users who 

1 5 may be interested in the present vendor's other products, products for which little is known, 

16 then it will match users who are least familiar with exemplar items. The idea is to reveal the 

17 most significant data about the user profile with respect to the present collection of items of 

1 8 interest. Finally, rapid profiling can use direct explicit queries to determine interest on an 

1 9 item(s) or to collect demographic data on a user. 

20 The target object profiles of advertisements on the ad server are matched against the 

21 user profile in order to automatically present the most relevant recommendation(s). 

22 Typically, the client-side proxy requires the host-level proxy to disclose the target object 

23 profiles of the products/services sold by the vendor. This data is stored as meta-tags in 

24 XML form and is encrypted. This data can be very useful to the user in navigation, filtering 

25 and search activities in the future or in a variation the ISP - level proxy a party (a neutral 

26 server) could store these target object profiles and selectively disclose relevant pieces of 

27 them (e.g. genre cross-correlations) to vendors, which are considered according to the 

28 disclosing vendor's data disclosure policy acceptable to receive this data. These profiles 

29 are not accessible to the client-level proxy but may be disclosed only if there are restrictions 

30 within the vendor's data disclosure policy. 
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1 In another variation, if the data to be disclosed to the vendor is acceptable to the 

2 original vendor but she/he is untrusting of the vendor, the data is received by the host-level 

3 proxy (another neutral third party) instead of the vendor, thus providing the disclosing 

4 vendor with an additional level of security, assurance about the use of his/her data while 

5 enabling the users of such a site to access all of the merchandise or content in a completely 

6 personalized fashion. Thus these XML tags are stored in association with, but on a separate 

7 server from the actual HTML pages stored on the vendor's site. Additionally, these profiles 

8 are constantly updated by user profile data conveyed to the host-level server which operates 

9 in distributed fashion. 

10 Advertisers may also stipulate additional rules relating to the delivery of ads to 

1 1 buyers in conjunction with performance of certain explicit actions e.g. submission of certain 

12 search terms, visitation of pages with certain key words (as a general tendency or on a per- 

13 page basis), visitation of certain specific pages or sites, performance of certain location 

14 specific behavior the delivery of which could be performed in accordance with the real-time 

15 behavior by the user if desired (e.g. visitation to a competitor's retail outlet or traveling 

16 within the proximity of the vendor's own store), otherwise the presently suggested 

1 7 algorithms may automate most of these manual functions by virtue of its recommendation 

18 capability e.g. matching ads to the most appropriate list of buyers or matching ads to 

1 9 metrically similar Web sites which may appear whenever the buyer visits that site(s). 

20 Ads may also take the form of digital coupons [KRJM 98] which are priced by 

21 vendors according to behavioral information which the buyer is willing to disclose to this 

22 vendor. Because many buyers are adverse to viewing additional advertising which is 

23 "imposed" in the preferred embodiment, any inserted ads delivered by iamworthit can be 

24 'turned off' by the buyer voluntarily unless the in kind value which the buyer receives in 

25 exchange for the advertising mandates a certain minimum ad impression delivery. Also in 

26 light of the above buyer concerns in the preferred embodiment, there are no additional 

27 impressions added to the buyer's web browsing experience. 

28 Ad blocking technology can be deployed to block existing banners and replace them 

29 with iamworthit ads. Ad blocking is commonly available, and the techniques used are well 

30 known in the art. The HTML source for the ad banners may be either called-up from the 

3 1 remote server (such as with an ad network) or alternatively from an ad server (typically 
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1 purchased by the vendor) on the vendor's own host server. In either case, the ad blocking 

2 software typically recognize the HTML source which originates from the ad server. A 

3 directory of the HTML source for the various ad servers is maintained with the ad blocking 

4 software (and presumably updated e.g. if new servers are added or change their IP 

5 addresses). The location/physical dimensions of the portion of the page occupied by the 

6 HTML source which the software removes (its "footprint"), may be readily replaced by 

7 another replacement banner (by iamworthit) from wherever the proxy server resides 

8 (typically on the client but potentially on the network). As a result of a potential difficulty 

9 in which the ad server deliberately changes its IP address to avoid recognition, which occurs 

10 constantly and dynamically, it may be possible to recognize portions of the page which 

1 1 contain image "features" of ad banners, which are inserted from another HTML source 

12 where that HTML source is linked to known vendor sites (which are also linked similarly 

13 from other site's ad servers and/or are known via their HTML sources to have previously 

1 4 utilized ad server technology and a "new" HTML source appears instead, etc. 

15 This model may be extended to other media domains e.g. replacing digital TV 

1 6 commercials instead with targeted ads delivered and precached via iamworthit for insertion 

17 at appropriate times i.e. during commercial breaks during standard video programming. 

1 8 Another variation could be tailored to pre-loading iamworthit advertising to automobiles for 

19 insertion in place of traditional radio commercials. In accordance with the parent patent 

20 application, (and as suggested above), email may be a useful targeted ad delivery medium as 

21 well Per the patent case, the system is able to classify email (according to its source and 

22 content using implicit or explicit actions of the buyer). Based upon certain desirable 

23 confidence threshold settings, the system may automatically delete "span" and replace it 

24 with targeted messages provided by iamworthit which the buyer is paid to receive 

25 pseudonymous physical mail is yet another potential source of targeted mail for which the 

26 buyer may be paid in conjunction with the user's voluntary removal from direct mail 

27 marketing services which iamworthit could provide in conjunction, iamworthit's buyer 

28 compensated targeted advertising model could be extended to a pseudonymous 

29 telemarketing service (as suggested above) using one-time call session pseudonymous 

30 telephone numbers for telemarketers to reach desired buyers. Finally, human sales persons 

3 1 could be the "ad delivery medium" for iamworthit. 
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1 13.1 .3.8 Example: Dynamic personalization of links within a web site 

2 Wide user profile data can also enable the dynamic personalization of links within 

3 a web site, to create a virtual shop floor to match the predicted preferences of each user. 

4 For a new user, that has never before visited the site, it is very advantageous for the site 

5 to already know about the preferences of that user in order to personalize the goods and 

6 services that it offers. The information provided at the secure data interchange, and 

7 gathered from the transactions of a user with another vendor, is vital for this type of 

8 personalization to first-time users. For return customers, an Internet site can also leverage 

9 its is information that it has collected from previous interactions with the user, 

1 0 information that is collected locally to the site. 

11 13.1.3.9 Personalized Search Engines 

1 2 The present state of the art for search systems involves the use of an extremely 

13 static interface, which is not personalized for a user. In the previous patented disclosure 

14 [INSERT U.S. PATENT NO.] we identify features of a user and anticipate areas of 

1 5 content likely to be of interest. A personalized portal interface can then allow a user to 

16 view categories that are presorted in terms of expected value to a user, based around 

1 7 more than the search term just entered. 
18 

19 For example, if AltaVista knows that I am a research student, then the search 

20 engine can bias results in the direction of information that is likely to be appropriate to 

21 my interests, and away from other items. The personalization and filtering can be done 

22 based on results that I have accessed in the past, and also on the basis of information that 

23 is stored about a user in the shared SDI data warehouse. It is possible to greatly increase 

24 the relevance of the search results through eliminating and prioritizing the potential range 

25 of results from the query. 

26 Within the system of SDI, where more information is readily available about a 

27 user, we might now also look at what a user is currently doing, his/her professional 

28 expertise, skills, proficiencies, etc., the reason for the user asking the question, for what 

29 purpose was the question asked, for what purpose will the results be applied (if relevant), 
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1 what is the geographic and/or time period content of the query, what type of content does 

2 a user prefer, e.g. humorous, wisdom/council, scientific, political, business oriented. 

3 In this way we can begin to incorporate contextual content into search. We might 

4 also allow users to contact other users who are determined to be experts in particular 

5 areas browsed, with this information dynamically discovered by queries into the SDI data 

6 warehouse. 

7 Collaborative filtering can help to identify individuals with a close affinity to a 

8 particular site/vendor, and community of interest or interest domain such, so that they 

9 can be informed of opportunities to act as experts and receive referrals on demand. This 

1 0 added ability to bring new content more efficiently and timely could be very valuable. 

1 1 Another simple extension is to provide personalization to the results from a standard 



12 search-engine. For example, we could use a user's profile to perform additional filtering of 

1 3 the results of the search to refine the pages returned on the basis of their profiles and the 

1 4 user's profile, i.e. using collaborative-filtering/clustering style techniques. 

15 13.1.3.10 Smart Browsing: Personalized Annotations, Filtered Information. 



16 We can also introduce personalized annotations to pages that are displayed to a user, 

1 7 with annotations selected to reflect the interests of the user, for example as provided by 

1 8 special-interest groups sharing affiliated profiles, or as provided by other users. A couple of 

19 definitions for this section: exemplar - the profile of target object or (as pertinent to 

20 following description), user profile which is "most like" the profile of the cluster to which it 

2 1 belongs, perhaps a median metric. 

22 The Platform for Privacy Preferences (P3P) [CR 98; RC 99] provides for the ability 



23 to utilize XML meta-tags to annotate Web pages, and within a system such as SDI we can 

24 allow comments from previous visitors to a page to add annotations. Users can receive 

25 annotations from the SDI data warehouse that are associated with annotations provided by 

26 other users with similar profiles. One of the divisional applications of the parent case 

27 "System for Customized Electronic Identification of Desirable Objects" [Herz 98b] relating 

28 to the automatic creation of virtual communities suggests that users may be automatically 

29 assigned to particular communities (e.g. chat groups, forums, etc.) for this purpose. 
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1 We might allow users to rate the annotations in the pages, allow SDI to learn how 

2 useful annotations provided by certain users are to new users, and classify users as "experts" 

3 that receive priority in the position of their current and future comments. Future comments 

4 from users with a poor rating history for a particular content cluster may be deleted. A 

5 persistent interface feature on the tool bar or side bar can provide for annotations to also be 

6 accessed by users selecting certain profile features of users as they browse from page to 

7 page; for example a user could identify the comments of a news article about abortion by 

8 users who are self identified as advocates of the Women's Rights Movement, ultra 

9 conservative senior citizens, teen women or those with a strong interest in alternative 

1 0 medicine or the Catholic Church. 

1 1 The parent case [Herz 98b] further suggests that users may actively provide ratings 

12 in a completely privacy protected manner according to various criteria of pages they browse. 

13 Profile-based clustering of these ratings and annotations can allow a user to submit as a 

14 query a user profile, to receive a page that rates or annotates a page, and a listing of highly 

1 5 relevant pages to a particular type of user. This could also be used in a "reverse engineering" 

16 sense, for example requesting the exemplar user profiles for users that visited and gave high 

1 7 ratings to a particular site, and then using those profiles to find similar sites of interest, or 

18 finding the user features that are in the exemplar profile to understand something about the 

19 content of a web page. The browser interface may automatically display the related links 

20 that are determined dynamically to be most relevant to a user, again statistically estimated 

21 via the data in SDI of users' browsing habits, and/or via active page recommendations or as 

22 book marks by those users as being of particular relevance or similarity to the present page. 

23 Browsing methods may include search-based browsing, and also browsing via a 

24 hierarchical navigation menu system, with users classified according to their behavior 

25 patterns and/or ratings which have been actively submitted. The parent case [Herz 98b] also 

26 suggests the use of hierarchical clustering for products, to help in a smart shopping system. 

27 Similar products can be presented together where the feature criteria for creation of the 

28 hierarchical cluster tree could be price or other criteria. In addition to allowing users to view 

29 item selections according to desired selection criteria, a comparison shopping function can 

30 also allow a user to view the attributes of buyers that tend to buy certain items to help to add 

3 1 confidence to a user that he/she is getting the right product. 
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1 The above description also describes the use of a hierarchical menu through which 

2 groups of users may be identified by their profile features (wherein a profile feature could 

3 even be a rating criteria itself of for example, an opinion via a site survey). These features 

4 could be used to either selectively filter-out content which falls outside of that criteria as the 

5 user navigates the information or identify itfwhen pages encountered where these user rating 

6 features are present, thus displaying this user statistical information in conjunction with the 

7 ratings statistics and/or associated annotations if desired. 

8 We can allow the user to use one or more organizations to provide features and 

9 annotations to help in a search and information interpretation process, to add appropriate 

10 filter and bias to information presentation as a user browses the Web. Similarly, a user might 

1 1 adopt the endorsements of a friend for annotations, and editorial content about particular 

12 pages. 

1 3 The availability of feature information about individuals that provide high ratings 

14 for particular Web pages can also be useful for vendors that wish to find appropriate 

1 5 locations for their advertisements. For example, given an "ideal profile" of a consumer 

16 for a particular product, a web page/type of information can be determined that will be 

17 useful to find appropriate customers. This is an alternative method to providing adverts to 

1 8 particular individuals, instead choosing to provide adverts to limited information domains 

19 that attract appropriate users. Vendors may request additional information, such as site- 

20 specific page view correlations (including time spent viewing each page) in accordance 

21 with user features. Exemplar user profiles and attributes of those users are certainly of 

22 interest to vendors to which those sites belong as well as affiliate sites on which their 

23 advertisements and/or syndicated products are advertised and sold remotely. 

24 We can also profile users with context information, for example based on their 

25 assumed goals (social, business, personal, etc.), information which can be determined by the 

26 nature of their current activities. 

27 In an interesting extension we can allow users to contact other users that are 

28 browsing similar material with similar profiles in real time, allowing ad-hoc virtual work 

29 group formation. Of course, all of this is done within the carefully managed profile and 

30 privacy managed systems of SDI. The location of a user could play into decisions about 

31 physical meetings. Typically some users may wish to enter into a present (or future) 
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1 dialogue, which can be scheduled via calendar agents, which perform automatically 

2 scheduled compatibility meeting/introduction functions. 

3 It is also reasonable to provide the technique of collaborative filtering to identify 

4 users whose profiles are particularly similar to the material which is being viewed, e.g., 

5 those users who are determined to have a high proficiency level with regards to that 

6 particular material. Because the requesting user may be seeking to query the expert or seek 

7 edification on that material, typically same consideration is conveyed in exchange for that 

8 information. In a variation, a more data exchange may be provided if 

9 collaborative filtering identifies fairly similar users to the presently viewed material which 

10 also demonstrate, via their profiles, considerable complementarily in the particular 

1 1 knowledge which they are likely to possess. Vendors may also and 

1 2 facilitate this type of information exchange on their sited between these complementary, (yet 

1 3 metrically similar) customers. 

14 In a consumer- vendor commerce application of this form of interactive browsing we 

15 can allow users to enter into dialogues with vendors based on their profiles, and share 

16 experiences with vendors, regarding their experiences. This can be done in real-time, or 

17 statically within an iamworthit style system architecture, where a user receives financial 

1 8 incentives for providing information that is useful to vendors. 

19 The parent case [Herz 98b] suggests that organizations (e.g. vendors, etc) may be 

20 rated by multiple users across and according to a variety of relevant attributes. One useful 

2 1 extension is to group descriptive opinions of users according to their profiles, certainly users 

22 with a particular type of profile are likely to have affinities towards certain sites and certain 

23 content. It can be useful to example how the assessment of user's varies according on their 

24 profile information. 

25 Another variation is to suggest a three-dimensional interface to allow useful 

26 diagrammatic representations of the information and meta-information that a user is 

27 presented with. The parent case [Herz 98b] describes a method to reduce the 

28 multidimensional characteristics of feature and annotation information to only 3 dimensions, 

29 and present that information graphically, via a technique of principle-component analysis. 

30 The profiles of the target objects associated with a user's preferences are represented as 

3 1 points in 3-D space within a navigational interface. Nodes may also be displayed according 
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1 to category and sub-category. Views may also be restricted to user's with the same goals, 

2 i.e. business users, social users, etc. this mind-set can be inferred by the sites that a user is 

3 visiting, for example. 

4 13.1.3.11 Smart Interface to On-line Discussions. 

5 The present description provides a methodology by which it is possible to apply 

6 techniques of statistical NLP for purposes of enabling users of live discussion groups and 

7 bulletin boards to dynamically visualize concept (via automatically generated labels short 

8 summaries or even visual representations of the concepts). The parent patent [Herz 98b] 

9 describes a browsing implementation in which it is possible for a user to view an on-line 3- 

10 D layout of a "virtual mall" in which the multi-dimensional space of the object profiles of 

1 1 the items are collapsed into three dimensions. Further, the display may be customized to the 

12 user's profile by using as the vantage point the position of the user's profile within the 

1 3 space. 

14 We can use information retrieval, pattern matching, and collaborative filtering 

1 5 techniques to refine information based upon the particular relevance feedback from those 

16 particular individual engaged in a discussion. The novel interface enables the extraction of 

17 text segments from all previous discussions. These (multi-discussion) segments are 

18 essentially grouped into clusters and the clusters are representative of "concepts". The 

19 statistical framework is further used to determine what clusters (concepts) tend to follow in 

20 sequence what other clusters. Because conceptual flow in human language is quite complex 

21 and perhaps somewhat relative to the individual, it is useful to provide different interface 

22 settings, e.g., for example one which is tailored to the individual (i.e., the layout of the 

23 conceptual flow based upon statistics taken from all individuals which are generally similar 

24 to the interest profile of the present user what it would be for certain types of user profiles as 

25 explicitly indicated or what it would be for the collection of user profiles of those 

26 individuals within the present discussion groups (or if two individuals, the user profiles). 

27 The large statistical information stored in SDI may be useful for 

28 the purpose of guiding discussions in a chat or forum context. 

29 Individuals may use the system in order to guide the flow of their own 

30 thoughts (where it is tuned to their own personal profiles) or that of 
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1 what a particular individual with which they are corresponding at that 

2 moment (or in this case ideally both of their profiles) or (to appeal to 

3 the overall discussion forum or chat room) the collection of user 

4 profiles in that discussion forum. Again, in a variation, the system 

5 may be tailored to give additional weight to the particular historical 

6 experiences of the individual or group, thus if implemented in this 

7 way to the individual, the system may act in a similar capacity to a 

8 remembrance agent.13.1.3.12 Example: Vacation Package System 

9 A vacation package organizer decides to begin a large-scale marketing campaign to 

10 target those people who would be the most interested in joining a new Caribbean Cruise. 

1 1 Although the vendor has a database of current customers, it is interested both in increasing 

1 2 the number and suitability of its potential leads. 

1 3 Interfacing with the secure data interchange with which it is a member, the organizer 

14 identifies several possible sources of supplemental data: a LEIA-based travel discussion 

1 5 group, an on-line bookstore, and a Caribbean restaurant. These are found both by browsing 

16 through the interchange's internal list of members, and by using SDI-based data analysis 

17 tools, used within the interchange to automatically identify entities sharing common 

18 characteristics. 

19 The package organizer then contacts each of these entities through the interchange, 

20 and negotiates different data-sharing deals: the travel discussion group is willing to 

21 exchange full information for a large travel discount, the on-line book store is willing to 

22 reveal the pseudonyms of users who have bought travel books in exchange for a per-sale 

23 commission, and the restaurant is willing to sell its entire database for a flat fee (and will 

24 provide an aggregated data set as a sample). 

25 The vacation package organizer now chooses fairly basic data-mining algorithms to 

26 identify the individuals with the greatest potential interest in a Caribbean vacation; however, 

27 the organizer does splurge on a new neural network approach developed by a small software 

28 company. On a per-sale commission, the Software Company is willing to loan the vacation 

29 package organizer use of its data mining code. 
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1 First, the organizer decides which data sets to use. The initial results on the 

2 restaurant's aggregated data aren't so good (its customers turn out to not be very affluent), 

3 so the organizer declines the purchase of the full data set. However, it does agree to the 

4 conditions asked by the travel discussion group and the on-line bookstore. 

5 The data provided by the discussion group and on-line bookstore, being in a 

6 common format, are moved in a secure fashion to the interchange's processing area, and are 

7 acted upon by the data mining tools, which are also in a compatible format. As per the 

8 agreement, the interchange forwards discounted Caribbean cruise offers to the members of 

9 the discussion group, and forwards standard promotions to targeted individuals in the book 

1 0 store's customer list. A few of these individuals respond favorably; these electronic transfers 

1 1 of money and passed back through the interchange, which slices off a commission for the 

12 book store before passing the accepted offers back to the tour organizer, who learns the 

1 3 identities of the customers and can now count them as part of its database. 

1 4 This protocol specification could even be digitally signed by the "owner" of the data 

1 5 as proof of ownership of the data and its associated restrictions by the owner, i.e., effectively 

16 a "digital deed" which is both legal and untamperable by any other party and thus acts as a 

1 7 legally binding proof of ownership and terms/conditions dictating how that data can be used. 

18 13.1.3.13 Example: A Forum for Political Views and Discussion The information 

19 within SDI can represent user's political views, and be used as a very advanced political 

20 polling system that respects the privacy concerns of users. For example, a user can state that 

21 information is released anonymously and that no identifying information is ever released. 

22 That said, within the iamworthit system an individual with a current and complete political 

23 profile could receive a steady income stream from semi-automated focus group queries. 

24 Taken one step further, it could then also be possible to identify for a user candidates 

25 which are believed to be a close match with a user's political profile. Expert opinions could 

26 be used to determine the individual issues and relative weightings (indicating relative 

27 priority which those issues assume within the politicians political priority scheme) as well as 

28 levels of extremism in views to political issues. In the preferred implementation a listing of 

29 different politicians is presented in conjunction with a percentage which reflects the degree 

30 of similarity among the views of the user and those of each politician as listed in descending 

3 1 order or similarity. The user could drill down and observe a redistribution of the percentage 
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1 based on particular desired political issues, or the user could observe the distribution under a 

2 variety of different user profiles. The system may also take aggregate statistics from other, 

3 various types of individuals and or organizations. That is to say each individual can, on a 

4 voluntary basis, disclose as much personal information, be it explicit or implicit disclosure 

5 (form his/her user profile data base), in order to create a more robust and comprehensive set 

6 of statistical relationships. 

7 The system could also ask the users presently to indicate which actual candidates 

8 they selected; thus collaborative filtering recommendations can indicate, for example, which 

9 candidates users with certain political issues-related profiles tend to vote for (even what 

10 other individuals, exactly like the present individuals, tend to do in this regard), or 

1 1 conversely, what political issues voters of certain particular candidates tend to claim to 

12 subscribe to. The system may also identify what characteristics (political or other wise) are 

13 most prevalent or alternatively most unique to individuals which share an affinity for a 

14 particular candidate or a particular issue, or the system may recommend candidates based 

1 5 on existing candidate affinities. 

16 The techniques of the parent patent [Herz 98b] may also allow a synchronous and 

17 asynchronous discussion with other individuals containing certain profiles (or identical 

18 profile characteristics). Again the pseudonymous server may be useful in this regard. The 

19 present system may also enable rating of candidates, in which various types of individuals 

20 (in aggregate) rate the candidate according to different criteria, characterizing the candidates 

21 on issues such as strengths or weaknesses. This is a useful polling example, the ratings may 

22 be revealed according to various types of individuals. 

23 13.1.3.14 Example: A "Virtual" Jury 

24 The judicial process attempts to be as fair and equitable a possible by allowing a 

25 jury of "peers" to be entrusted with the responsibility of evaluating the guilt or innocence 

26 of a particular individual as well as to determine the degree of liability, the penalty and 

27 degree of financial compensation which justified in light of the circumstances. The 

28 information within the system of SDI could be used to store information about users, 

29 explicit and/or implicit information based on their browsing habits, and offer a service, e.g. 

30 in civil cases for jury selection based on characteristics of individuals. In a criminal case the 
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1 'Virtual jury" may be used in addition to a real jury, to provide additional feedback on a 

2 case. The virtual jury may be paid by the plaintiff to participate, e.g., in the case. The 

3 feedback from the virtual jury may be presented to the judge and/or an actual jury before its 

4 own deliberations and verdict. In a novel variation the virtual jury may even hedge bets 

5 upon its conclusions such that if they contradict the conclusions of the actual jury but are 

6 later determined to be correct the members receive compensation. The advent of live 

7 streaming video greatly facilitates the feasibility of the current methodology. 

8 In a variation, it may be possible to apply the techniques of collaborative filtering 

9 towards automating the selection instead of those individuals which are representative of the 

1 0 most diverse possible groups and clusters of individuals a cross a wide range of criteria 

1 1 (including such criteria as psychographics, ethnicity, intelligence, social status, etc.) In fact, 

12 subject to some changes in the law, this approach would be a more efficient means of 

1 3 selecting a standard jury than the process of random selection. 

14 In one variation, the virtual is selected based upon similarity to the 

1 5 plaintiff. In another, it may of individuals which are individually similar to both 

16 plaintiff and dependent on more effectively a combination of individuals who are similar 

1 7 individually to the plaintiff and the defendant respectively. It is believed, in addition to guilt 

1 8 innocence, a more equitable assessment of the monetary compensation which is provided to 

19 the plaintiff by defendant, it such a equal mixture of empathizers from both sides 

20 constitute a virtual jury. 

21 13.1.3.16 Example: Group Therapy 

22 There is a very useful and appropriate application for SDI to the automatic matching 

23 of individuals for purposes of group therapy. Individuals with commonality across many 

24 criteria can be selected, and multiple long-term groups adopted. In an application of this 

25 system, it may be possible to broadly extend the present scheme to the general public (where 

26 privacy is secured via the proxy server) and individuals may identify a group(s) which best 

27 fits their own unique emotional needs. It may also be useful to archive the sessions, index 

28 enabling the sessions and segments thereof to be searchable by keyword, e.g., via speech to 

29 text techniques and/or browsable by topical segment (which may be automatically 

30 segmented and labeled). The present system may also provide an ideal framework by which 
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1 psychologists may identify patients of other psychologists that similar pseudonymous 

2 complements of psychological characteristics and symptoms, to allow targeted clinician 

3 interactions and robust and useful information from therapeutic approaches and/or drug 

4 treatment regimens. 

5 In a variation, the present system methodology also enables a means by which much 

6 more specialized group session topics may be created which focus upon a very specific type 

7 of disorder, conflict or aspect of the patient's psyche. In this way, it may be possible for 

8 "identical" patients to, as group, focus upon different aspects of their ideal therapeutic 

9 regimen which collectively create a comprehensive customized treatment program. It is 

10 worth noting that the specialization possibly associated with particular focused sessions (and 

1 1 even types of individuals) provides a framework by which clinicians can become extremely 

1 2 specialized and expert within certain specific sub-domains of the field. 

13 It is certainly possible to adapt the above described technique for "smart interface" 

14 to provide useful ideas and suggestions for patients engaging in either psychotherapy or 

15 group therapy thus leveraging the information and personal experiences within the 

16 therapeutic processes of the collection of patients precoding them within similar topical 

1 7 sessions which shared very similar psychological and pathological patent profiles. 

18 It is certainly reasonable and appropriate to adopt the present application framework 

19 to more several fields of medicine. For example, enabling physicians to identify other 

20 physicians have had patients with the identical pathological profile and 

21 medical history. This enables them to current medical history and clinical data, 

22 insights, observations, etc., in order for the physician to the present clinical based on 

23 the expert advice and collaborative feedback from the other physicians possessing very 

24 similar experiences. Likewise patients could instantly access the physician or physicians 

25 which have or had experienced the most similar clinical situations that and the present 

26 situation of the patient in order to seek a second opinion. Physicians sharing unique clinical 

27 experiences (particularly clinically or scientifically in or series/problematic could use 

28 the scheme in order to companies, etc.. Finally one could 

29 also apply the 

30 

31 . 
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1 13.1.3.17 Example: A Personalized Educational Portal 

2 Another interesting application is to develop and deploy a personalized on-line 

3 informational portal containing everything from helpful links, illustrative content, text 

4 book information, quiz questions etc. The Personalized Educational Portal (PEP) 

5 typically is designed specifically for a particular class and includes several key features 

6 which are optimally applied as part of a comprehensive intelligent educational system. 

7 These features include the following: 

8 o The ability to dynamically customize all forms of relevant information from the 

9 educational portal. The criteria for this customization however is based not upon 

1 0 the preferences of the user (unless for example the user receives credit for studies or 

1 1 projects or research on topics which s/he may select, rather it is based upon a 

12 predicted profile of the user reflecting his/her strengths and in understanding the 

1 3 relevant content. In particular the techniques of the issued patent number 6,029, 1 95 

1 4 [WHAT IS THIS PATENT NAME] describes a variation of user profiling in which 

15 users are able to achieve a proficiency profile within certain domains of 

16 informational content where these informational domain(s) are determined in 

17 accordance with user's ability to answer a certain question(s) intelligently, discuss 

1 8 the answer to a certain question or about a certain topic or provide a useful reference 

1 9 or URL based upon the level of satisfaction of the requestor. This technique in itself 

20 could be usefully applied within the present application framework. For example, 

2 1 users may be students, and "experts" fielding questions could be other students (like 

22 tutors of sorts) and the payment they receive may be monetary compensation or even 

23 school, credit where other student's satisfaction ratings both qualify them for future 

24 opportunities to submit future responses within that particular knowledge domain 

25 as well as means of verification, and measurements of his/her proficiency over that 

26 particular material. 

27 o Using the techniques of the parent patent application it is possible to also customize 

28 content delivery (including quantity, depth, and difficulty level) which reflects the 

29 user's strengths /weaknesses within the various relevant knowledge domains. It is 

30 possible to even statistically correlate the user's correct /incorrect responses to 

3 1 certain questions or types thereof as correlated with each other by common terms in 
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1 the question answer part concurring in similar textual segments. These questions 

2 may be found in on-line exams or quiz questions associated with the actual content). 

3 It is possible to statistically correlate which content tends to best remedy certain 

4 deficiencies (as determined by incorrect responses to certain questions) by the user 

5 importance in being able to readily correctly answer those types of questions 

6 following reading the pages. 

7 It may also be useful to apply the above technique towards determining which exam 

8 questions or combination thereof most commonly are revealing (if the student answers them 

9 correctly) of a student's proficiency within a particular small yet well definable sub-domain 

10 of knowledge, i.e. in which statistics show that If the student answers that sub-group 

1 1 correctly, s/he will likely answer the others in that sub domain correctly as well. 

1 2 A sub-set of each of these exemplary questions from each knowledge sub-domain 

1 3 may be allocated as questions provided at the end of each relevant section of reading content 

14 (which may be on-line, off-line or only the responses may be entered on-line, exclusively 

1 5 and if a wrong response is submitted, the system may recommend reading certain content 

16 which has (as above suggested) been statistically demonstrated to improve the student's 

17 proficiency in that sub-domain. It may also be possible to gauge in advance the students' 

18 predicted level and rate of advancement by knowledge domain via on-line psychological 

19 testing. Statistical correlations between these test questions and the student's learning 

20 abilities (by knowledge domain) can be usefully developed. 

21 The system could even be constructed hierarchically by graduation of skill levels i.e. 

22 initial mix of questions of varying difficulty levels in order to gauge what level of content to 

23 recommend. The questions which follow are one level higher as is the next set of 

24 recommended content unless improvement is not observed. 

25 It may be possible to a decision tree which automatically select which questions 

26 most effectively test the students, command of the materials (in this way a shorter exam 

27 may actually be better than for longer exams those questions which are more exemplary of 

28 such knowledge may carry greater weight). We could also effectively try to create a 

29 summary e.g. by topics /headings or even using text summarization techniques submit the 

30 areas of deficiency in one student to another student who is particularly proficient in that 

3 1 area for virtual tutorial. For this the tutor can receive monetary and/or also scholastic credit. 
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1 The ideal scenario is actually creating virtual study sessions in which students are matched 

2 together which have the most different complement of proficiencies/deficiencies as possible 

3 and where these areas of potential mutual exchange benefit are maximized and are revealed 

4 to both students initially. Other forms of virtual study groups could be achieved by grouping 

5 students by similar proficiency areas and levels and creating a "virtual tour" with questions 

6 and content presented synchronously to a group of correspondence enabled students. In this 

7 way, if virtual tutoring or study groups occurs on-line exclusively, it may be performed 

8 pseudonymously as desired. We can even take a defined study group and assign it to group 

9 projects which can (again) relate to content to which the group is (collectively) deficient, (as 

10 group oriented problem solving has been proven to be extremely effective in remedying 

1 1 such deficiencies. 

12 In a very novel application, it may be possible to even statistically using historical 

13 data to analyze the relative proficiency profiles of students by knowledge domain (perhaps) 

14 even further by, learning ability in those domains by teacher. Thus a virtual class could be 

15 constructed using live streaming video which are truly customized to each student's 

16 particular needs and abilities, i.e. the idea would be before the semester to identify which 

17 teacher(s) were most successful in eliciting the highest levels of achievement among 

1 8 students which have an identical profile to that of the student (such statistics used in this 

1 9 profile may include but are not limited to the proficiencies / grades of the student in all other 

20 previous classes, psychological testing and/or (selective) knowledge domain proficiency 

21 questions (as above described). Certainly depending upon such investor's personal 

22 preference, it is reasonable for each to agree within their personal data disclosure policies to 

23 disclose to SDI data regarding the types of investments, amounts and under what conditions 

24 (including those conditions stated by other investors such as the example above). The 

25 conditions for this disclosure, however, would be that no individual data be disclosed to 

26 another investor, i.e., that it be used only for SDI to be able to reveal aggregate statistics and 

27 predictions how the investment community, in aggregate, interacts with investment 

28 opportunities, and under what conditions, and in exchange for this disclosure, each investor 

29 earns the right to access this aggregate data. The individual investor or SDI, acting on 

30 behalf of that investor, may then suggest the best terms and conditions with which to 

31 negotiate with other investors, in order to optimally achieve their own desired objectives, 
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1 part of the condition statement to SDI could also include the amount of money to invest in 

2 each synergistic investment opportunity which SDI typically would suggest to the investor 

3 based upon the relative distribution of its presently invested funds, size of the available non- 

4 invested funds as well as the relative risk/benefit proposition of the company to that investor 

5 (as estimated by SDI's use of company data as disclosed to that investor). SDI can even 

6 recommend or act as a proxy, in negotiating with the individual companies (who themselves 

7 could use an SDI negotiating agent representing their own interests). E.g., SDI, acting for 

8 the investor, could leverage considerable information about not only, the present investment 

9 opportunity, but literally all other investment opportunities which had been submitted to SDI 

10 for consideration by SDI affiliated investors. Based upon the total benefit which the 

1 1 investor stands to gain which is based, in turn, upon the SDI recommended comprehensive 

12 investment strategy) and the relative downside (relative risks compared to other alternative 

1 3 prospects within the total pool of investment opportunities) may be used to determine basic 

14 parameters in which the investor or its SDI proxy would be willing to deal, e.g., the amount 

1 5 of funding, the associated amount of equity as well as other more subjective factors such as 

1 6 amount and nature of control, the use of the funds (e.g. will they be used for a particular 

1 7 technology product or service which, if developed, offers the primary degree of benefit to 

18 the investor's existing company(s). It is also worth noting that SDI may identify those 

1 9 prospective investors whose collective cooperation would result in an economically optimal 

20 scenario for that investor. SDI, acting on behalf of the collective group of investors, may 

21 recommend data disclosure terms to each individual investor (or its SDI proxy) which will 

22 optimize the probability that the coalition resulting from the cooperation of all or most of the 

23 recommended participating investors will optimize the economic benefit to that individual 

24 investor. This data could be no more than predicted value (including marginal value) that 

25 particular investor would achieve the desired coalition were to be formed. Based on this 

26 data the investors or their SDI proxies could negotiate with one another in order for each 

27 investor to strive towards its own individuals best interest while attempting to assure that the 

28 global objectives of the coalition are ultimately achieved. E.g., that not only the best 

29 investment opportunities within this optimal global plan are funded but also the least 

30 desirable as well (of course this information would be concealed from the companies at least 

31 until the negotiations are concluded). Accordingly, the relevant investors may wish to 
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1 compete with one another for the best investment opportunities within this global plan or 

2 SDI with their unanimous cooperation again acting in their collective best interests may as a 

3 coalition assign investment opportunities to each investor assuring them each equitable 

4 treatment and, of course,... no competitive bidding for a given investment opportunity 

5 (which, of course, would result in ultimately more favorable terms to the collective group. 

6 In this preferred application (of non-competitive bidding) the SDI agent for the 

7 collective group achieves a commitment from each prospective investor in the group that 

8 within a range of prospective terms, each investor would be willing to commit its 

9 participation in the proposed investment opportunity. Accordingly, each company 

10 (assuming there are more than one) also agrees to a predetermined range of terms with 

1 1 which it would be willing to accept from the collective group of investors. Both sets of 

12 information (typically recommended by the SDI representation each associated party) are 

1 3 disclosed to the main (party neutral) SDI entity. If overlap exists in the range of terms or the 

14 ranges are close to one another, the introduction and negotiation between the parties 

1 5 becomes initiated (through the company(s) are not made aware of whether the overlap exists 

16 or is merely "within the general proximity" of accept 

1 7 introductions on the interest of the parties (though most 

1 8 markedly the companies) 

19 13.1.3.18 Example: A "Group-Think" Ideas Market 

20 In the consumer driven market place there is a significant unrecognized 

21 opportunity in being able to harness the intellectual capital of the consumer market. 

22 Historically, (in a non-networked environment) leveraging of such knowledge has been 

23 impractical and virtually infeasible due to the inability to identify and measure this 

24 resource of human skills (and thus match this skill with in the appropriate matching 

25 problem)with the particular types of problems which companies may be in need of new 

26 ideas and solutions as well as the lack of a trusted intermediary which can validate on an 

27 impartial including basis the authentic contributions of the provider of the information (as 

28 not having been claimed by the recipient after the fact as having been previously 

29 conceived before). Of course, there are remaining obstacles such as the fact that 
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1 important problems are sometimes maintained confidential within a company (not even 

2 disclosed to some its own internal staff). 

3 With the emerging of extranets a certain amount of inter-organizational 

4 knowledge leveraging is becoming more readily achievable. A secondary advantage over 

5 an extranet is because of its ability to act as a trusted intermediary. It can validate the 

6 fact that that the individual is only dealing with one commercial entity and not its 

7 competitors (at least formally). Thus unlike extranet based knowledge leveraging it is 

8 conceivable a trust relationship enabling a certain amount of privileged information 

9 disclosure is at least theoretically possible. 

10 In the preferred embodiment, SDI retains a record as to the particular 

1 1 problem/solution sets that were successfully or unsuccessfully provided by users in the 

12 past in order to improve predictions as to their abilities to successfully address the same 

13 types of problems/needs in the future as well as to more accurately determine the 

14 "similarity" of different and (perhaps often apparently) disparate problems from the 

1 5 standpoint of the user's ability to solve problems based upon knowledge of their ability to 

16 successfully (or unsuccessfully) solve others SDI is entrusted with the knowledge of 

17 particular needs of organizations (it may, for example, cooperate with Think Task 

1 8 companies). These needs are commonly oriented ideas (though not exclusively so), for 

19 example, suggestions or ideas for product development, marketing, a business model or 

20 concept (such as a new business direction), content creation, technology or even potential 

21 commercial customers or strategic partners. In lieu of identifying particular problems to 

22 which an organization may be desiring a solution, users may instead provide their own 

23 ideas and/or solutions to needs or problems which they perceive SDI vis-a-vie its in- 

24 depth knowledge of the internal needs and problems of numerous entities is thus able to 

25 determine the relevance of the suggested solutions or ideas or alternatively identify a 

26 particular organization which is likely to possess matching need which is relevant. 

27 13.1 .3.19 Experimental Validation of New Models: A Test-Market 

28 The system of Secure Data Interchange can be used for a test market application, 

29 where the effect of a vendor's new collaborative- filtering based targeting model, or new 
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1 pricing model, or advertising campaign, can be tested by simulation and "hold-out" 

2 testing. 

3 A small sample of users, can be targeted, and the results measured and reported to a 

4 vendor, before a vendor commits to an extensive campaign. This may provide a cheap 

5 method to test market conditions for a product, and run tests on sample groups selected on 

6 the basis of similarity to other products for which buyer response is known. Random 

7 sampling of buyers may be initially performed using data mining as applied for marketing 

8 research. Based upon this initial response vendors may identify key features of respondents 

9 which are typical buyers, non-buyers, proponents, non-proponents, etc. Accordingly, rapid 

1 0 profiling (as similarly) may enable the automated or semi-automated selection of disclosed 

11 segments of users (containing particular attributes) which are likely to have relevant 

12 properties. The user selection process may be iterative based upon responses from users as 

1 3 they are selected. 

14 We can demonstrate this value experimentally, for example we can offer a vendor a 

1 5 free-trial and present personalized information/advertisements to one group of SDI users 

16 (both on the site and if desired also across the network), and regular advertisements etc. to 

1 7 another group without the aid of SDL The increase in vendor revenue can be estimated 

1 8 from client-level monitoring of the change in purchase volume achieved with well-focused 

19 solicitations on the vendor's own business. Other prospective ads and syndicated 

20 purchasables could likewise be virtually overlaid on the vendor's site. Thus degree of 

21 increased click through rates, transaction rates and syndication revenues could be tabulated 

22 for that vendor in advance of his/her subscribing (or even being solicited) to SDI. Portals 

23 are also potential beneficiaries of this service in light of their need to better target their 

24 partner's ads. 

25 The ability to the tailor a targeted marketing strategy to users knowing these 

26 correlations could be extremely beneficial in both on-line as well as off-line commercial 

27 environments. In the process of selecting the target user profiles significant value may be 

28 achieved from data collected from that vendor's own on-line presence, and a portion can be 

29 sold to other similar vendors; as discussed in the main description of SDI, this now forms a 

30 cached query which can be used by other vendors. 
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1 We might also demonstrate value to vendors with SDI by monitoring the 

2 performance of vendors with SDI technology, and providing metrics for new vendors to 

3 allow them to select suitable models of user targeting and personalization. Vendors that 

4 subscribe to SDI (the vendor centric version) can provide more attractive offers/products 

5 to users, based on information about the wider activities/interests of a user, on other 

6 vendor pages, and in the physical world (of course, only to the extent that this 

7 information is authorized by the user). Vendors can use information in the central SDI 

8 data warehouse for users' cross vendor and within vendor browsing and purchasing 

9 habits, and also with respect to profiling information about a user. 

10 One key application is first-time personalization, so that information and products 

1 1 can be targeted to a user when he/she first hits a web page, based on profile information 

12 that the user is willing to release. Vendors and users may also sell this information to 

13 other vendors. For example, information that a user likes a particular type of music is 

14 very valuable for vendors that sell content-based products, for example books, and CDs. 

15 Vendors can personalize their service, for example with collaborative-filtering based 

16 recommender systems. 

17 13.1.3.20 Example: An Efficient Product Delivery System 

18 The parent issued patent [FILL IN THE PATENT NO.] describes an application of 

19 collaborative filtering to the strategic optimization of a vendor s business, for example to 

20 allow a vendor to select an optimal location for an inventory warehouse based on projected 

21 consumer purchasing patterns; using the aggregate purchase history of users at that site 

22 compared with the other purchase selections at other sites for similar users. The model can 

23 also be used to predict demand for new items, and optimal locations for inventory given 



24 warehouse locations. 

25 Now, within SDI the vendor can have control over two things: 

26 1 ) the vendor has access to profile information about users 

27 2) thevendor has a method to personalize and annotate on a dynamic basis the 

28 presentation of products to a user. 

29 In common with the smart caching application of SDI to making communication 

30 networks more efficient, we can suggest an application to a product delivery system. At any 



Page 184 of 3 70 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 moment a seller can have a good idea of the products that are available, and even the 

2 products that are in trucks etc., close to a prospective customer. This information allows the 

3 vendor to selectively present items that are close to the user, with labels such as "this is 

4 within 30 minutes of your door, NOW". Messages like this allow a vendor to manage its 

5 delivery process, because predictions can be made about likely purchase patterns, and then 

6 stock that is brought close to user locations can be explicitly advertised to those users. 

7 Essentially, we attempt to predict transaction volume on a per item basis, and then 

8 positioning geographically physical inventory storage facility locations where inventory can 

9 be stocked so that items which a user is predicted to request are already located within the 

1 0 immediate physical proximity of that user. 

11 As the trend to disintermediation continues (whereby we have informational 

12 middlemen, but not middlemen that physically hold products) we can allow vendors to 

13 notify portal intermediaries of product location within SDI, so that the portals can then pass 

1 4 this information onto customers. 

15 The key role of the SDI system in this application is to allow users to receive 

16 personalized recommendations of products, and also product locations, based on their 

1 7 profile information — but without the vendors receiving that profile information. In addition, 

1 8 the shared information about user profiles in the data warehouse allows vendors to build 

19 good models for inventory location. Finally, SDI can act as a trusted intermediary between 

20 different competing vendors: for example suppose Amazon users bookstore A and 

21 bookstore B to provide books, and A and B do not wish to tell each other where their 

22 products are. The stores tell SDI where the books are, and SDI presents that information to 

23 users on a per-book basis, as books as browsed within the purchasing system. 

24 There are numerous useful applications to improving quality, speed and cost of 

25 delivery to a user. In one example, it may be possible to provide same day delivery for on- 

26 line purchases. We could also restock a truck on the basis of what a local population of 

27 users are likely to purchase; with items presented to a user along with a particular 

28 anticipated delivery period, with the Global Positioning System (GPS) or more LEI A 

29 [INSERT THE PATENT NO.] on the vehicle providing up-to-date information about a 

30 truck's location and anticipated delivery time. 
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1 We might also suggest that a user can physically travel to some location close to 

2 his/her base, based on information within SDI about the user's travel patterns, etc. 

3 Purchasable items can then be located in locations that are convenient for users, or shipped 

4 dynamically to those locations (i.e. intermediate warehousing systems). Users can be 

5 provided with real-time directions to the location of such a pick-up point via personal digital 

6 assistants (PDA) and navigational devices, etc. 

7 1 3. 1 .4 Push-based Advertising/Solicitation 

8 Another form of information that a user can submit to the central SDI database via 

9 his/her client is the advertising acceptance functions, which state the explicit preferences of 

1 0 a user for adverts, and required payments. The level of compensation that a user requires to 

1 1 receive an advertisement will depend on the relevance of an advert. 

1 2 An application of SDI is that vendors can use the centrally stored information to 

13 request that as a result of a query the query-execution module in the central SDI data 

14 warehouse sends advertising solicitations to appropriate users, i.e. users that are willing to 

1 5 receive adverts as specified within their acceptance functions, and willing to release profile 

1 6 information to enable useful advertising as specified in their price rules for regular data. 

17 Figure 15 illustrates the system for push-based advertising. A vendor requests the 

1 8 right to advertise to users by sending a special type of query to the central SDI data base, 

19 where the query states the vendors preferences for user profiles and requests that adverts be 

20 sent to users. The vendor never receives the contact information for user's directly. The 

21 server also receives payments from vendors and credits users' accounts. 

22 Suppose Vendor V wants to advertise a new product to relevant users. Vendor V can 



23 use the SDI database to select users with a high predicted hit rate for the advert, and then 

24 compute the value of a bid that it is prepared to pay users for the right to provide them with 

25 its advert. The vendor makes a request for bids to appropriate users by formulating this as a 

26 query with an associated action, and submits to the central SDI data warehouse where it is 

27 handled by the query-execution module. Part of the query is a bid function, indicating the 

28 maximum amount that a vendor will pay to display an advert. 

29 The query-execution module evaluates the request with respect to the profiles of 

30 each user, the rules that agents submit for allowing access to profile information, and the 
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1 advert acceptance functions. If successful the vendor pays the minimum value necessary to 

2 have its ad accepted by the user, again simulating a Vickrey auction as for the dynamic 

3 competition for banner ads described earlier. 

4 For example, suppose a vendor has an advert Ad to push to users, and is willing to 



5 pay up to $1 to place the advert with users that have a predicted hit rate of greater than 30% 

6 on the advert. The vendor can formulate a query that will first identify users with a predicted 

7 hit rate that is greater than 30%, and then determine which users will accept the adverts at 

8 the bid price. If successful, the adverts are pushed to the users via the SDI central data 

9 warehouse, and the payment required by each user is submitted, less than $1 in all cases. In 

10 a simple variation, the vendor might also state a fixed budget, so that it does not send 

1 1 advertisements to more users than it can afford. 

12 We can use the same profile information that provides focused/personalized service 

1 3 to users that hit a site that they have not visited before to form well-targeted mailing lists for 

1 4 vendors. The Secure Data Interchange can form mailing lists in a number of different ways. 

1 5 First, consider a vendor that wishes to send targeted mail to some of its own user-base. 

16 When users connect to a site they indicate whether or not they are willing to receive 

17 electronic mail, and provide a "mail certificate" to a vendor if they are happy to receive 

1 8 mail. The Secure Data Interchange can proceed as follows: 



19 (a) Perform analysis for the vendor to determine an appropriate set of users to receive 

20 the solicitation, based on the information that the vendor provides about what it 

21 intends to market, and provide the list of pseudonyms to the vendor for mailing; 

22 (b) Perform the same analysis, but also forward the communication to the users 

23 directly. 
24 

25 Now, consider a vendor that wishes to target new users, represented with different 



26 pseudonyms. Users indicate whether the information that a vendor submits about his/her 

27 transactions may be used for solicitations, and furthermore vendors indicate the set of 

28 business interests that can receive the benefit of information that is submitted to the central 

29 SDI server. The SDI server can continue by performing analysis on the relevant subset of 

30 the permitted class of data records that pertains to the product or service that the vendor 

3 1 wishes to model, and generate a list of appropriate pseudonyms. Finally, the SDI server can 
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1 sell the pseudonyms to the vendor outright, together with a certificate that the vendor can 

2 send mail to the pseudonyms, or the SDI server can retain control by sending the mail on 

3 behalf of the vendor. 

4 Provide vendors with virtual mailing lists that can be mailed to via the proxy server 

5 only. i.e. these customers should be solicited based on our analysis, (could even give 

6 summary info., without revealing details about users). Furthermore, do not even reveal data 

7 that corresponds to a pseudonym to a vendor because the vendor then has that information 

8 about me when I am on his site. 

9 A central data warehouse also enables vendors to identify new potential customers. 

1 0 This process is broken down into a number of steps: 

11 (i) The vendor assesses the value of the information present in the secure data 

12 interchange. This computation is performed securely either by revealing randomized 

1 3 aggregates to the vendor to enable its own local analysis, or by allowing the vendor to 

14 check data and algorithms into the secure data interchange site for analysis. 

1 5 (ii) The vendor selects criteria for mailing unsolicited advertisements, and agrees on a 

16 pricing model. In this case per-impression pricing is the most obvious pricing model, 

17 as it is difficult to monitor when a user responds to unsolicited mail per-transaction 

18 pricing is difficult. The user could be motivated to do this should the Secure Data 

19 Interchange promise future returns for recording a successful solicitation with the 

20 database. 

21 (iii)Either the data list is released to the vendor for its use, if this is within the selling 

22 vendor's data policy, or the data interchange sends mailings on behalf of the 

23 purchasing vendor. 

24 1 3.1 .5 Community Dollars and Business Models 

25 The primary objective of the iamworthit model is to create a market for 

26 information about buyers, with agents that submit profile and other personal information 

27 to the database able to collect payment in return for queries executed by vendors. In an 

28 important extension, we can allow this payment to be made in terms of community 

29 dollars, which can only be spent with particular vendors. Community dollars allow a user 

30 that provides profile information to the system of SDI to receive payments that are 
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1 dedicated for a particular type of purchase, these payments are called "community 

2 dollars". One central example is to allow a vendor that signs a user into the SDI system to 

3 be able to lock a proportion of payments accrued by that user to the vendor's own 

4 product/service domain. This is important, because vendors now have incentives to bring 

5 new users into the system. A vendor can offer a user community dollars on its (and its 

6 affiliates) web site, in exchange for receiving user data via the SDI data exchange. 

7 We allow vendors to pay in "community dollars" for adverts, dollars that can only 

8 be spent at that vendor (with the host site of the advert receiving a share of the profits). This 

9 provides vendors with the ability to gain long-term customers. Furthermore, so long as the 

10 buyer agrees to receive advertising from his/her iamworthit subscription offer, community 

1 1 dollars can be replenished at the rate at which advertisers are willing to pay for impressions. 

1 2 This provides buyers with an incentive to spend at the vendor's site, because the vendor can 

13 monitor (pseudonymously) the buyer's that are sensitive to discounts and other special 

14 offers (that are delivered as community dollars). With community dollars a vendor can 

15 compensate buyers for information that they access, but tie that information to certain 

16 vendors (e.g. the vendors that first signed the user into the system). The system has the 

1 7 following useful properties: 

18 (1) buyers are incentivized to provide information that allows vendors to push relevant 

1 9 advertisements/products; 

20 (2) buyers will also be more likely to make purchases at a site or a coalition of sites for 

2 1 which they can receive discounts via community dollars; 

22 (3) providing buyers with community dollars will increase the number of hits to a site. 

23 We allow buyers to receive compensation for providing personal data to vendors, 

24 information that has value to vendors because it allows information to be focused (for 

25 example relevant ads can be displayed to a buyer, based on his/her profile). The system of 

26 iamworthit credits buyers for information, and provides buyers with direct incentives to 

27 reveal profile information to vendors. 

28 A vendor can sign up with iamworthit.com and agree to provide only the most 

29 restrictive type of community dollars, that can be spent at that vendors site. Community 

30 dollars are the currency that vendors provide in return for the right to provide focused 

3 1 information to buyers. Dollars can be general (e.g. for a network of vendors), or very tightly 
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1 focused (e.g. for a particular product, at a particular time). The buyer-centric infomediary 

2 acts as a broker, matching buyers and vendors. Another key role of the infomediary (e.g. the 

3 portal) is to protect the buyer from information saturation by controlling the flow of 

4 solicitations, (i.e. restrict the number of ads. that a buyer sees) 

5 Community dollars extend the methods in Secure Data Interchange that allow a user 

6 agent to sell controlled access to information to other agents, introducing in addition a 

7 method which can control the ability of a user to spend received payments. For example, a 

8 vendor might be happy to pay a user for the right to display a targeted advertisement if the 

9 payment represents a discount that can only be used against products offered by that vendor. 

10 Community dollars provide this functionality, allowing dollar payments to be limited to 

1 1 reimbursement in particular "communities" of vendors and/or other agents. 

12 Community dollars instead are more configurable than traditional currencies, which 

13 by their very nature are transferable to any agent. In particular community dollars can be 

14 restricted to spending at certain vendors, can also have non-linear values to encourage 

1 5 loyalty from buyers. 

16 The novelty in our description of community dollars in comparison to current 



1 7 methods known in the art for electronic cash is that we allow e-cash to be targeted so that it 

18 can be spent in particular ways, to be smart so that it can accrue value in non-linear ways, 

1 9 and to be transferable only between the pseudonyms that belong to the same agent. 

20 13.1.5.1 Simple Business Models 



21 A vendor that allows community dollars to be spent does not need to implement a 

22 special community dollars/discounts program. The buyer can also be issued a special debit 

23 account dedicated to community dollars, that permits pseudonymous transactions without 

24 revealing a buyer's portfolio of pseudonyms. 

25 A portal site that hosts advertisers and buyers that subscribe to iamworthit can 

26 mandate that all community dollars are to be spent at sites that advertise on the portal site, 

27 and also only when the sites are accessed via the portal site. This technique will increase 

28 portal traffic. Portals can be expected to compete in terms of: (a) the fraction of advertising 

29 revenue that is turned over to buyers, in return for receiving profile information from buyers; 

30 (b) the level of advertising that buyers are exposed to; © the nature of the community dollars 
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1 "package", i.e. what vendors can the dollars be used at etc. This can be useful to attract 

2 niche customers, that have common outlooks, interests, and business needs. The primary 

3 goal of the portal is to drive traffic through the portal. 

4 The main mode of the community dollars advertising model allows vendors to 

5 advertise for free, but provide community dollars to buyers, that can be spent at some later 

6 time. The cost of advertising can be linked to the success of advertising. Moreover, the 

7 vendor can direct offers and adverts to particular user profiles. The hosting web page 

8 receives a share of the vendor's revenue that comes from transactions involving community 

9 dollars. The dollars can represent "stored value", such as bonus points, that can be applied to 

10 special discounts for offers which are delivered via digital coupons and/or as "straight 

1 1 value" which could be converted directly to purchases thus are equivalent to real dollars at 

1 2 the point of transaction. 

1 3 The community dollars can be "credits" that can be redeemed as real cash, credits 

14 towards discounts, and can be spent across a suite of sites, or limited to one site. The co- 

1 5 pending patent application entitled "System for the Automatic Determination of Customized 

16 Prices and Promotions" [U.S. Patent App. 10/262,123] describes a comprehensive scheme 

17 which may be implemented in either on-line or off-line commerce environments. The 

1 8 system enables vendors to deliver a digital message in the form of a promise to a buyer 

1 9 (typically on encrypted form for purposes of targeting a buyer specifically). This promise is 

20 typically a discount for a product, set of products (or all products in stock) or may even 

21 include entitlement to special privileges for that buyer, thus it is termed a "digital coupon". 

22 The community dollars can represent special discounts for a buyer. 

23 The buyer receives a financial incentive for receiving well-targeted solicitations, 

24 while preserving buyer privacy within the SDI system. The vendors support the community 

25 dollars through advertising revenues and increased sales volume. We can also provide the 

26 vendor through which the buyer first subscribes a special "first screen" right that allows the 

27 vendor to provide a buyer with his/her first impression as soon as s/he logs on. 

28 Of course many variations of the community dollars scheme are conceivable and the 

29 current description is in no way intended to limit the scope of the claimed invention. For 

30 example, the ad revenues generated may instead be apportioned between direct payment 

3 1 which the buyer receives, in real cash, community dollars which the vendor (or ISP) credits 
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1 the buyer and/or direct payment which the vendor (or ISP) receives as well as iamworthit or 

2 any combination of the above such as exclusively direct payment which is apportioned 

3 between the buyer and the vendor (in the absence of the community dollars scheme). 

4 In one variation all community dollars collected by a buyer must be spent back at the 

5 vendor site at which they originally subscribed (and also the site that hosts the adverts of 

6 other vendors). A buyer can spend the dollars with any vendors that are site partners of the 

7 original site. This provides the vendor an incentive to accept and promote the community 

8 dollars concept. 

9 The value of providing a buyer with targeted solicitations is estimated at 

10 approximately $300 to $500 per year (based upon $120 per 1000 targeted impressions at 

1 1 approximately 25 impressions per day). Given these significant benefits, a vendor can 

1 2 provide a buyer with a significant discount (in the form of community dollars). Vendors 

13 benefit from increased sales volumes. When the price of items is less than the value of 

14 dollars, the vendor can limit the amount of discount that is available on any single product, 

15 or only allow community dollars to be applied towards customer discounts (which may 

1 6 nevertheless be quite substantial). 

17 13.1.5.2 Implementation Details 

1 8 We can use meta-tags to restrict the way that community dollars can be spent. The 

19 tag associates the dollar, but the dollar is released within the system of blinded signatures 

20 [Chaum 85; Chaum 92] so that a buyer that collects dollars over many transactions with 

21 different vendors can spend the dollars without compromising his/her private information 

22 about pseudonyms. Dollars can be restricted to a number of vendors, and also restricted in 

23 additional ways — i.e. they can only be sent if the buyer visits the site through a particular 

24 portal, cannot be redeemed at a competitor, are worth a bonus if redeemed with certain 

25 vendors, etc. 

26 The technology used to implement community dollars is based on cryptographic 

27 techniques known in the art, for example within a digital signature scheme for electronic- 

28 cash as described in the work of Chaum [Chaum 85; Chaum 92]. The core functionality 

29 which must be supported by any system for community dollars within SDI is: 
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1 a) An agent can receive community dollars under multiple pseudonyms but spend 

2 the dollars under the same pseudonym. The dollars must not allow an adversary to 

3 link the pseudonyms and break a user's identity. 

4 b) Community dollars can be restricted to be spent with certain vendors and for 

5 particular products/services. 

6 c) Community dollars can accrue (and lose) value in non-linear ways. 
7 

8 The function of (b) can be supported within a system of electronic coupons, based 

9 on cryptography, as disclosed by Tygar et. al. [Tygar 98]. Alternatively, we can extend 

10 Chaum's [Chaum 85; Chaum 92] technique of blinded signatures to allow dollars to be 

1 1 signed with the private key of a certifying agency, where the key is the private pair of a 

1 2 public key which indicates the ways in which the dollars can be spent. 

1 3 The use of blinded signatures ensures that at the point of validation of cash, where a 



14 stream of random bits is signed by the "bank" to validate its value and restrict its use, the 

1 5 bank is blinded as to the identity of the bits, so that no-one other than the agent that creates 

16 the bit stream can trace the cash. This allows community dollars to be transferred between 

1 7 multiple pseudonyms of the same agent without being used to break an agent's identity. A 

18 vendor will accept community dollars as part-payment so long as the digital signatures 

19 indicate that the dollars are valid for the purchase. An extension to the scheme presents 

20 duplication, with real-time verification of tokens. The vendor can finally exchange the 



21 dollars for "regular" cash via an electronic bank that accepts community dollars (debiting an 

22 associated account). 

23 We suggest an electronic wallet application to allow users to store tokens locally, 

24 and then transfer them securely between pseudonyms. The electronic wallet can be 

25 implemented in a client-side SDI proxy, or within the central SDI data warehouse, within 

26 the eBank module. The invention draws on techniques described in D. Chaum [Chaum 88; 

27 Chaum 92], where the use of a secure chip in "smart card" technology prevents the 

28 tampering and duplication of e-cash. 

29 The solution proposed is that all e-cash must be signed not only with the private key 

30 of a vendor, but also with the private key that corresponds to a particular pseudonym within 

3 1 SDI. This is illustrated in Figure 1 6. 
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1 Effectively cash becomes a one-to-one mechanism for payment, tied to future 

2 transactions between a user under a particular pseudonym and a particular vendor, for a 

3 particular purpose. The eBank allows a user to transfer cash across pseudonyms via the 

4 client-side SDI proxy which maintains the user's profile identifiers. 

5 We support a dollar transfer command, of the form TRANSFER (PIDl, PID 2, 

6 amount) where PID_1 and PID_2 are the public keys to identify two pseudonyms owned by 

7 the same agent. Before executing such a transaction the eBank requires that the requestor 

8 answers challenges for both identities, for example signing random messages with the 

9 corresponding private keys. This proves that the requesting agent has authority for both 

10 pseudonyms. Finally, we need to be sure that the pseudonyms belong to the same subscriber 

1 1 to SDI, which is possible via the one-way identity look-up functionality of the central SDI 

12 data warehouse. The eBank can query the look-up table, and check that both PIDs 

1 3 correspond to the same UUID. 

14 For example, an amount of dollars below a certain threshold can be programmed to 

1 5 have no value, but additional dollars can have increasing marginal value. This provide a 

16 non-linear incentive for a user to stay with the same vendor in a market with low switching 

1 7 costs, or a non-linear incentive for a user to receive adverts from the same advertising agent. 

1 8 The advantages to vendors and advertisers are repeat purchases, consumer lock-in, and also 

1 9 improved profiling from interacting with the same agent over extended periods of time. 

20 The functionality is embedded into the community dollars. The framework expands 

21 the idea presented in Chaum [Chaum 85; Chaum 92] where the public key of a public key 

22 /private key pair indicates the dollar value of a community dollar. In this case, we allow any 

23 number of public-key/private-key pairs, and use the public key as an index into a table 

24 maintained within the eBank to provide additional criteria about how the cash can be 

25 redeemed, for example it has more value if used in combination with other community 

26 dollars, it decays over time from data YY/YY/YY, etc... We do not intend to limit the 

27 variations possible, but provide this as a framework for ways in which constraints and 

28 conditions on community dollars can be extended. For example, community dollars can also 

29 be programmed to lose value over time. This can increase the number of repeat purchases 

30 In the preferred implementation we use an "electronic cash" infrastructure for the 

31 community dollar system. A buyer's SDI-enabled client-level proxy stores dollars that the 
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1 buyer receives securely. Dollars are anonymous and non-traceable, so that the buyer can 

2 maintain a single "bank" of dollars, and aggregate dollars collected across pseudonyms for a 

3 single purchase, so long as the purchase satisfies the constraints on the dollars. Each dollar is 

4 created using Chaum's blinded signature technique, and also signed with the conditions on 

5 its use. 

6 This scheme allows vendors to monitor the offers that buyers respond to, because 

7 when a buyer presents a community dollar — the dollar can be validated to indicate the type 

8 of discount that it is, even if the identity of the dollar (i.e. the serial number) is untraceable. 

9 SDI provides vendors with guarantees that buyers have once-in-a-lifetime pseudonyms, so 

1 0 redeeming a voucher of a particular type that is redeemable only at vendor V and was issued 

1 1 by vendor V allows vendor V to be sure that the voucher was issued under the same 

1 2 pseudonym, and has not been transferred to another of the buyer's pseudonyms. 

13 In an alternative architecture, the ISP-level SDI proxy, or the web-host for the 

14 advertising service, can maintain community dollar "debit" accounts for each buyer. This is 

15 more limited, because it does not allow buyers to transfer dollars between pseudonyms 

16 without compromising privacy (revealing a portfolio of pseudonyms). However, in a 

17 scheme where advertisers require that agents have once-in-a-lifetime pseudonyms, and only 

1 8 release community dollars to be redeemed at their own site, this is not limiting. Both of 

1 9 these approaches are useful for "community dollar-enabling" numerous or all sites. 

20 13.1.5.3 Example Community Dollar Applications 

21 © On-line Gaming site. 

22 Consider an on-line gaming site that has a network of affiliated vendors, that do not 

23 pay to advertise, but provide community dollars that can be spent either at the 

24 vendors — or at the casino. When buyers lose money at the casino the casino receives 

25 real dollars from vendors. The casino is one possible outlet for spending dollars — and 

26 a vendor only pays the casino (the host of its ads) if a buyer chooses to gamble on the 

27 site, and loses its dollars. The gaming site becomes a portal, with links to partner 

28 vendors. Each vendor offers the buyer community dollars, that can only be spent 

29 back at that site or at the casino. However, the number of dollars which can be spent 

30 at the store is substantially less than the number of community dollars which can be 
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1 spent at the casino. IF the buyer accepts the credits, whenever the buyer accesses the 

2 URL to the store he/she is either automatically routed first to the casino portal or to 

3 the vendor site whereby a prominent banner is displayed which is displayed to that 

4 particular buyer from which the buyer can conveniently engage in a casino gaming 

5 session. 
6 

7 If the buyer loses a substantial amount of community dollars he/she may regain the 

8 lost credits by spending a specified amount (in real dollars) at a partner vendor. This 

9 provides a safety-net for buyers. The cost to the vendor is the cost of the dollars that 

1 0 the buyer lost at the casino, and the cost of replenishing the buyer's community 

1 1 dollars (which can be used for further gambling). However, the vendor makes a 

12 sale — so the vendor is happy so long as the dollar value is a reasonable discount for 

1 3 the sale. Clearly, the casino gains substantially through the redemption of these 

14 community dollars. The vendors can make an agreement with the casino where they 

1 5 only compensate a fraction of community dollars. A percentage of a buyer's profits 

16 at a casino are paid in community dollars, another percentage can be paid in real 

17 dollars.. Perhaps the casino can also provide vendors with a revenue share. 
18 

1 9 We allow for community dollars that are restricted to particular products, and 

20 customized for an individual buyer. The dollar-object can contain two parts: the first 

21 part is readable to the buyer, and indicates the nature and the amount of the discounts 

22 to which the credits can be applied. The second part of the message is encrypted, and 

23 accessible only to the vendor, and is signed by the vendor to prevent any form of 

24 alteration. The information can contain the dollar credit to the buyer, the terms and 

25 conditions of the community dollars, a dollar amount, the pseudonym ID of the buyer, 

26 an expiration date, the terms and conditions of discounts and special offers to which 

27 community dollars may be applied in combination with a partial cash transaction. 
28 

29 The vendor must check that it has not previously redeemed any piece of community 

30 currency with the same identifier; the identity of the buyer is correct; the date; the 



Page 196 of 3 70 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 terms and conditions. Some vendors may allow redemption of the community dollars 

2 at other vendors' sites. 
3 

4 o Web-hosting Application. 

5 One of the commercial applications of which community dollars may be used is in 

6 offering web sites free web development services and/or free web hosting services. In 

7 the former case a Web development tool provider could share in the community dollars 

8 revenue stream from iamworthit ad revenues, and accordingly cover the costs of the tool 

9 suite. Whenever new upgrades including new development modules are added, the 

10 costs could be covered by an incremental increase in the percentage of advertiser 

1 1 transactions. There is considerable economic pressure on web tool providers to provide 

12 competitive solutions which are not only rapidly deployable but also extremely robust 

13 (typically bundling some form of personalization technology), thus as a result of 

14 economic pressure to drive advertising and e-commerce transactions through the site 

1 5 (often through the use of a revenue sharing model which can reduce upfront costs to the 

1 6 vendor substantially), iamworthit through its customer loyalty enhancement capability 

17 provided by community dollars can significantly improve the effectiveness of 

18 personalization at the site (perhaps more significantly than cookies which often are 

19 blocked and all typically not implemented so as to be recognizable across the collection 

20 of sites which the tool provider supplies or the web host services). 
21 

22 It is believed that the revenue sharing model may be structured to cover both the cost of 

23 the tools, web development services as well as web hosting (thus unless he himself 

24 offers a complete turn key suite of solutions and services a tool provider which 

25 integrated the community dollars concept could be in an extremely competitively 

26 advantageous position in attracting strategic relationships with Web developers and Web 

27 hosts. The tool provider may for example sell an ad server module or could 

28 "automatically subscribe" the user with an opt-out option. The very fact that all of the 

29 tool , Web development or hosting provider's sites are exempt from ad blocking/ad 

30 replacement technology (barring iamworthit competitors) would be a significant 

3 1 motivating factor in incentivizing sites to utilize their services (e.g. through promotional 
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1 programs). An emerging large market for Web development is providing these services 

2 to individual end-users. Completely free services in this regard would likely mobilize 

3 what activity in what is now a dormant, albeit enormous commercial market. For 

4 example, tools for developing elaborate Web-wide community portals could have 

5 customized community links which are matched to the user's personal profile. Visitors 

6 to the site (subscribed to iamworthit) could experience an additional layer of 

7 personalization based upon their own user profiles. 
8 

9 Web hosts may also use an additional feature (optionally to significantly drive increased 

10 traffic through potentially all sites on their network. This optional feature is a set of 

1 1 links (e.g. along the side of the user's screen). These links are used to point the user to 

12 other relevant pages issuing aggregate site usage statistics of their visitors (as described 

13 in the parent patent application) iamworthit, user referral links may additionally 

1 4 personalized based upon their user profiles. 

15 13.1.5.4 Vendor coalitions 

16 Vendors may choose to form coalitions, to allow buyers to spend community 

17 dollars at any "partner" site. Vendors that have similar buyer bases can be automatically 

18 identified using collaborative filtering, (i.e. determining similarity with the present 

19 vendor, from the aggregate vendor preferences of a given vendor's subscribers). Also, 

20 these resulting metrics could incorporate predicted online spending by each buyer at each 

21 site. This could help to narrow the selection of sites the vendor wishes to partner with 

22 and/or the selection of these partner sites could be determined and presented to the buyer 

23 to even further narrow the selection for each buyer. All vendors in a coalition advertise, 

24 and provide cross-links and up-links to other vendors. 

25 The coalition model is good for buyers that are more likely to find products that 

26 they want. Vendors can share the risk of advertising, since dollars provided to one buyer 

27 by a particular vendor can be redeemed at another vendor. Advertising and community 

28 dollars increases sales volume at all vendors in the coalition. Furthermore, studies on on- 

29 line buying behavior suggest that on-line shoppers tend to make purchases across a 

30 variety of categories of e-commerce products, thus it is likely that they would also prefer 
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1 the freedom and flexibility which is associated and can be provided in this way only by 

2 deploying a multi-site community dollars scheme. Iamworthit usage statistics are very 

3 effective in identifying prospective vendor coalitions which consist of complementary 

4 (non-competitive) vendors. Alternatively, personalized coalitions which can be created in 

5 ad-hoc fashion for each buyer can further serve the buyer's interests by increasing the 

6 freedom and flexibility which the buyer often strongly desires. Though practically this 

7 can only occur for those (perhaps smaller less commercially significant) vendors which 

8 have not been a major coalition consisting of vendors which many buyers tend to use 

9 which collectively serve a substantial percentage of the overall user base. 

1 0 Thus if the vendors have not established firm partnerships with other vendors, we 

11 can even allow vendors to form dynamic and virtual coalitions within SDI, with a 

1 2 potentially unique coalition of vendors for each buyer. The coalition may consist of an 

13 optimal pool of vendors, as determined by SDI collaborative filtering techniques. The 

14 goal in this model is to provide buyers with a particular "brand" of community dollars. 

1 5 o Multi-vendor Community Dollar Portal 

16 We can allow each vendor to retain an exclusive right to advertise to each buyer; and 

1 7 also develop a portal for the coalition — that gives advertising prominence to coalition 

18 members. Portals will be expected to aggressively promote community dollars. 

19 Buyers that collect community dollars become loyal return visitors to the portal and 

20 its associated vendors. In the case the vendors do not generate the same value we can 

21 provide community dollars in proportion to the value that a vendor contributes to a 

22 coalition. 
23 

24 We can also provide targeted advertisements for the vendors at the portal, using the 

25 user profile to focus ads. The categories and links at a portal (that might include a 

26 search engine) can be re-prioritized (highlighted and/or re-ranked) in accordance with 

27 the buyer's preferences (as described above), and to favor subscribing vendors. In 

28 commercial practice, the motivation for vendors to become coalition members is 

29 largely driven by customer partnerships with providers of e-commerce tools and 

30 solutions. There are many emerging trends by which these intermediary tool 

31 providers could conceivably integrate community dollars. Consider loyalty points 
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1 (e.g. www.mYPoints.com ) or loyalty bonuses programs (e.g. www.clickrewards.com) . 

2 These points or bonuses could be substantially subsidized by the advertising and 

3 deeper benefits passed onto the consumer. Other simpler technology which would 

4 compel these vendors to cooperate include shopping basket technology, the emerging 

5 standard, ECML, common Web tool and/or tool development solutions, common 

6 hosting solutions, common ad delivery systems. 
7 

8 Vendors pay the portal site to advertise, and the portal provides community dollars to 

9 buyers in return for privacy-protected profile information. This model does not 

10 provide incentives for the portal to provide well-targeted adverts, because there is no 

1 1 direct link between a portal's revenue stream and the vendors' sales volumes. 
12 

1 3 A portal with community dollars that can only be spent under a single pseudonym at 

14 its partner sites also provides an incentive to buyers to interact under a single 

15 pseudonym — which in turn allows a portal to profile buyers across its complete 

16 vendor partner network. Buyers will access many sites with the same pseudonym. 

17 The system of SDI allows vendors to leverage the shared profile information as 

1 8 buyers browse web pages and products. 

19 We can also lock buyers into a single portal — and a single coalition of vendors — with 

20 community dollars that "decay" over time, and must be continually replenished. In 

21 this way a buyer cannot pick and choose different portals, and different community 

22 dollars, but can benefit mainly from high web-browsing volume through a single 

23 portal. The value to vendors in terms of consumer lock-in can be considerable. 

24 o Affiliate Network. A coalition of vendors can join to allow a buyer unlimited 

25 access over all affiliate vendors. The program can be sold through the existing 

26 marketing channels of each vendor, as well as through a portal directory of sites 

27 for those vendors. Vendors that join can be required to promote the program 

28 through their own marketing channels. Additionally, perhaps vendors are selected 

29 to cover exclusive physical regions (e.g. in the case of a set of ski resorts), or 

30 exclusive product categories (e.g. in the case of on-line vendors). Vendors can 

31 provide a community dollar- for-real dollar exchange, in return for becoming part 
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1 of a vendor network. Alternatively, perhaps vendors provide an up-front fee, 

2 that can be recovered via dollars spent by buyers at their own site. Each vendor 

3 is obligated to sell the partner network community dollars, but is not necessarily 

4 required to promote the other community dollar vendors. 

5 An affiliate program could be developed by partnering with the affiliate network's 

6 advertising. Typically each advertiser provides a purchasable(s) which affiliate sites 

7 offer for sale on their sites in exchange for a percentage of the resulting sales. In the 

8 community dollars variation, each vendor agrees to tailor an offer for their 

9 products/services on the affiliate network which is subsidized by community dollars 

10 which the user collects after subscribing to iamworthit. An example solicitation 

1 1 could be "receive $100.00 free credit toward future purchases at booksamillion.com 

12 plus $50.00 cash. 

13 o Creation of an iamworthit online multi-store retail site. Establish a retail presence 

14 in a (or potentially multiple) retail niches. The primary business model would be 

1 5 to leverage existing large iamworthit subscriber base (involving the other various 

16 types of commercial partners) in order to dedicate a certain percentage of the 

1 7 community dollars (e.g. thirty percent or approximately one hundred fifty dollars 

18 per customer) which could only be redeemed at that multi-store retail site (and/or 

19 the value of these dollars could be worth more at the retail site). In addition, in 

20 this model, the independent advertising initiative of iamworthit would be geared 

21 towards community dollar credit of that retail site. It should be noted that, because 

22 if other outside competition occurs to the basic iamworthit scheme to a substantial 

23 degree there will not be a compelling incentive for buyers to adopt a more 

24 restricted form of valve (as retail credits at a particular site), versus accepting the 

25 credit from a competitor in the form of cash. Thus this model could provide a 

26 viable means for attaining a leading position in one or more on-line retail markets 

27 if this competition does not substantially exist. 
28 

29 © Marketing Network. Iamworthit sites which offer a community dollars promotion 

30 could, upon the buyers subscribing to iamworthit, additionally offer the buyer 

31 with a down-loadable client based software which provides a small promotion in 
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1 conjunction with a link to iamworthit. Each time a recipient of the email 

2 subscribes to iamworthit, a percentage of the value of that customer is credited 

3 back to the buyer in the form of community dollars. Each subscriber resulting 

4 from the current subscriber's email (though reduced) provides an additional credit 

5 to the original subscriber in accordance with the marketing network business 

6 model. If the site originally delivering the promotion is not an e-commerce site, 

7 a percentage of advertising revenues resulting from the subscriber (and potentially 

8 all resulting subscribers) could be used. It could be applied in the form of 

9 iamworthit advertising (or exchanged) for advertising in an ad server. 
10 

1 1 o Free Web hosting. A portion of community dollars may be allocated in a revenue 

1 2 sharing arrangement between the buyer and the web-site from which subscriptions 

13 to the service are provided, thus enabling Web hosts to offer free hosting services 

14 while also receiving frill payment which is iamworthit advertising supported. 

15 Iamworthit could also easily use its vast statistical data collected form each site in 

16 the hosting network in order to predict which sites tend to be most commonly 

17 linked to from the present site (and/or share "similar content"). This technique is 

18 described in the parent patent application by establishing these links users can 

19 better access relevant information. In one preferred variation, all non- customer's 

20 sites are not included in the referral links. This level of targeting is likely to drive 

21 significant traffic through the Web hosts' network (and even more so if 

22 iamworthit user profiles are transferred to the site upon visitation of an iamworthit 

23 subscriber as these similar cross-links can also be personalized to the user. 

24 13.1.5.5 Transaction-based Revenue-sharing 

25 In transaction-based revenue sharing, the only time that advertisers pay to provide 

26 an impression to a user is when a sale results, in this case the hosting site receives a cut of 

27 the final transaction price. The vendors provide buyers with community dollars directly. 

28 The dollars, which are stored at the portal site, allow buyer-spending to be tracked. This 

29 allows the portal to monitor when a sale occurs, not just a hit on a banner ad. With 

30 transaction-based revenue of this kind, personalization is critical. In this model the portal 
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1 with give prominence to adverts from successful sites. A portal site may forgo payment 

2 from a vendor in exchange for the increased click-through from a strong network of 

3 community-dollar enabled vendors. Value is credited directly to buyers for future 

4 redemption at that particular vendor's site. The community dollars provided to a buyer 

5 can be restricted, such that a buyer can only redeem dollars if s/he maintains enough 

6 visits to the portal site. 

7 Vendors can offer discounts on their own products directly, instead of providing 

8 the portal with money for advertising. The vendor only pays to the extent that its 

9 advertisements are well-targeted. The vendor could also request special ad priority. A 

10 vendor that presents advertisements to a buyer offers the buyer discounted promotional 

1 1 offers for products offered by partner vendors, in exchange for subscribing to iamworthit 

12 and receiving targeted impressions. These offers are in lieu of community dollars, and 

13 can be provided by partner vendors — maybe in exchange for a right to a number of ad 

1 4 deliveries for the vendor' s own advertising purposes. 

15 We can also require that buyers are automatically routed through a portal when 

16 accessing any partner vendor directly. The portal (and therefore the coalition of vendors) 

17 then receives exposure each time the buyer clicks on an ad (or link) to that vendor. The 

18 portal may also provide benefits (e.g. additional advertising prominence) for sites which 

19 are responsible for driving traffic through the portal. Community dollars can be provided 

20 whenever the buyer accesses a site from the portal. Portals can offer free advertising to e- 

21 commerce sites (forgoing advertising fees). The portal provides discounts to buyers that 

22 purchase a product following a link provided at the portal. 

23 A buyer receives the discount by validating a purchase with the portal, and the 

24 site agrees to provide the portal with a share of revenue whenever the buyer cashes in 

25 community dollars in this way (we do not rely on HTTP refer mechanism because that 

26 can be blocked and falsified. Furthermore, we do not rely on URL+extension 

27 correspondences, also not secure — instead rely on providing buyer's with incentives, and 

28 monitoring buyers that have earned community dollars.) 
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1 13.1 .5.6 Delivering per-impression dollars 

2 We can deliver community dollars on a per-impression basis, with vendors 

3 competing to offer buyers high values for being able to present an advert. The existing 

4 collaborative-filtering engine at a buyer's SDI client-level proxy can filter ads, and select 

5 appropriate offers, using community dollars as just another measure of the usefulness of a 

6 message. This is an alternative to providing dollars on a one-off (or even yearly) basis, 

7 for consumption via the vendor's site that the buyer subscribes to the service. 

8 A hosting site can take a fraction of any dollars provided to a buyer. Alternatively, 

9 a site can convert the value into the community dollars to provide to the buyer, possibly 

10 at a preferable basis. The portal might also wish to convert its commission to credits for 

1 1 the buyer at any one of its partner vendors, with the stipulation that the buyer must access 

1 2 those sites via the portal in order to be able to redeem the credits. 

13 13.1.5.7 Providing Loyalty Bonuses 

14 We can use the client-side SDI proxy to provide vendors with "loyalty guarantees", 

1 5 that are credentials to verify that the buyer has executed no transactions with any competitor, 

1 6 under any of its pseudonyms. The client-side SDI proxy is in a unique position to be able to 

17 implement this monitoring, because no other system knows a buyer's portfolio of 

1 8 pseudonyms. The buyer can present its digital credential when visiting a vendor's site. 

19 A vendor may wish to provide loyalty dollar credit; for example, it would be 

20 possible for vendors to offer buyer's credits if the buyer is a 100% loyal customer i.e. that 

21 she/he did not (over a specified period) do his/her purchases at the site of any competitor. 

22 For example, certain types of high value customers could be given considerable value in the 

23 form of credits or discounts as a result of demonstrated vendor loyalty. The credential can be 

24 time-stamped, to prove loyalty. It does not reveal any information about the buyer's other 

25 pseudonyms to a vendor, because many pseudonyms will exist that have not made any 

26 purchases from a competitor. Upon accessing the vendor's site, this credential may be 

27 presented to the vendor. One criteria for the above benefits could be that the buyer may visit 

28 a competitor site, and engage in interactions; however s/he should not transact with that 

29 vendor. 
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1 13.1.6 Alternative Business Models 

2 o Offer discounted or free services in return for the right to access profile 

3 information. 

4 Examples: Free or discounted retail products with "niche" partners in each category; 

5 Free dial-up ISP (as an independent ISP or a service to jointly promote free access 

6 with ISPs); Free Cable and ISP service, Free pay-per-view (note that viewing patterns 

7 and the associated content could provide additional valuable user profile 

8 information); Free phone service (e.g. advertise subscription service on screen phones 

9 or audio ads from pay phones); Free prepaid calling card; Free print media 

10 subscriptions (magazines, newspapers); Free book clubs; Offer any combination of 

1 1 the above with "deep discounts" for each (this can involve $350 community dollars 

12 per buyer or it may simply involve certain purchasing limitations per customer). 

13 Each vertical niche partner in exchange gets exclusivity within their own respective 

14 niches to target advertise to those buyers (e.g. retailers); Free access to sporting 

15 events; Free credit for casinos; Free lottery tickets; Free charity donations; 

16 Discounted hotel lodging; Monetary credit to a credit or debit card (either an 

17 iamworthit branded card or provided as a partnership with the card companies; 

18 Monetary credit to a diner's club; Free subscriptions plus credit to retail buyer's clubs 

19 (on-line or off-line); Credit or discounts for book clubs; Free musical concerts, or 

20 theater presentations, movies or access to arcade entertainment; Free access to 

21 amusement parts or theme parks; Free golf season passes; Free commission fees for 

22 stock trading; Free commission fees for travel booking (if implemented for on-line 

23 buyers would be less compelled to search for travel information on-line though go 

24 off-line to make their bookings). 
25 

26 o Allow the ISP to promote free Internet access through personalized advertising on 

27 iamworthit. 
28 

29 The ad server can even recognize through the associated domain names, the buyers 

30 which are coming from a competitor ISP. So long as that ISP is not a partner of 

31 iamworthit, the associated buyer would be selectively targeted with an offer of this 
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1 sort "free Internet access" by subscribing to iamworthit" Smaller ISP's would be 

2 particularly compelled by such offers to their direct target prospects. This is because 

3 they are operating on a "thin margin". Furthermore, both they and their small 

4 regional counterparts would be particularly vulnerable to this type of advertising by 

5 regional competitors from the same geographical area, during specified period of 

6 months of initial usage of the service, the share of profit due iamworthit could instead 

7 be committed to purchase additional advertising for the Internet service provider (or 

8 the balance of this profit traded-out in the form of additional advertising through the 

9 ad server partner). 
10 

1 1 The ad server partner could further become an exclusive partner of iamworthit on the 

12 following commercial venture: Relationships as established with on-line merchants 

13 and other e-commerce sites. The vendor actively promote an offer to their customers 

14 through both off-line media (using a URL unique to that vendor) and on-line 

15 advertising through the ad delivery partner. The offer may say (as an 

16 example),"receive three hundred dollars in purchasing credit at Books a Million in 

17 exchange for subscribing to iamworthit (or receive five hundred dollars worth of 

1 8 discount credits at Books a Million. In addition, as a further benefit to the vendor 

19 iamworthit could trade its own advertising impressions with impressions on the ad 

20 server for the current offer (in order to reach a different base of buyers which are not 

21 currently subscribed). A particularly compelling industry for this application is on 

22 line travel inasmuch as a plaguing problem to this industry is the fact that many 

23 buyers use travel sites as an informational resource on available travel deals and 

24 packages, but ultimately book their trips directly through a travel agent (thus cutting 

25 out the travel site). A three hundred dollar a year travel credit would be a compelling 

26 incentive to many buyers to modify their current travel booking habits. Affiliate 

27 networks are also an ideal channel for these types of promotions because affiliate 

28 sites agree to participate (typically) purely based upon the degree of the profit sharing 

29 opportunity (which would be significantly larger than most types of transaction - 

30 based affiliate advertising). 
31 
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1 Alternatively, it would be possible to offer websites the ability to become Internet 

2 service providers where the interface to the ISP home page would essentially be 

3 heavily branded to that site or portal. Companies like GTE already offer a "Virtual 

4 ISP" service in which the content to the ISP home page is unique to the ISP while the 

5 network is provided by the virtual ISP service. This model would be particularly 

6 compelling for sites which are largely community oriented and have a potentially 

7 loyal customer base. Moreover, interestingly, many of these community sites are 

8 offering many of the services and capabilities that a full-blown ISP would offer from 

9 its home-page, e. g., a portal interface, links to high-quality content, chat/forums, e- 
1 0 commerce, commerce affiliate links, etc. 

11 

12 o Free Community-based Content. 
13 

14 Create premium content which is free to iamworthit subscribers, subsidized by 

15 revenue from profile information. Each iamworthit buyer would be granted free 

16 access privileges to the premium content on all sites which are part of the program. 

17 Some content may be purchased and/or reusable, other may be entirely site-specific 

18 and novel. This model would be particularly appropriate for community sites which 

19 are largely member-based (or for example ISP-member-based communities) where 

20 much of their value to members is based upon information and other content which it 

21 can provide. It is conceivable that all iamworthit-enabled community sites would 

22 enable free access to their content by all other iamworthit customers (though it is 

23 possible that they may be mutually restricted if members of competitor communities 

24 as desired by the community site). If an ISP service is not already provided, a virtual 

25 ISP service could additionally be offered at a substantially reduced price or possible 

26 free. One could imagine further extending this present network of free content to free 

27 content iamworthit subscribers for free access to fee-based television programming or 

28 VOD services. In as much as community sites and television channels are becoming 

29 different media for delivery of the same information as the number of channels 

30 increases, VOD becomes technically enabled, and, most imminently, full motion 

3 1 video can be delivered upon demand over the Web. 
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1 

2 o Free Access To Subscription and Fee-for-use Information of the Web. 
3 

4 In addition to the aforementioned free community site content, it would be possible to 

5 further provide free and automatic access to fee-based information on the Web. 

6 These costs may be able to be covered by the model across all or most sites 

7 depending upon the usage characteristics of its buyers (e.g. assuming 

8 advertiser/community dollars payment to the sites are averaged across buyers in 

9 accordance with the consumption patterns of the average iamworthit buyer). The 

10 identity (pseudonymous) of the buyer would have to either be disclosed to the site via 

1 1 the proxy or a unique pass code (as required by the site) provided to the buyer could 

12 be automatically entered upon the buyer accessing the fee for use area requiring the 

13 code. A directory (portal) of these fee-based sites would be a useful adjunct to 

14 subscribers. 
15 

1 6 o Bundling iamworthit links with hardware with a PC manufacturer. 
17 

1 8 Many PC manufacturers are now recognizing e-commerce as a very important sales 

19 channel. The present model would involve the P.C. manufacturer bundling a link 

20 along with a promotion for iamworthit. The promotion would offer the buyer cash 

21 credit for the buyer. The PC manufacturer would also receive exclusive advertising 

22 rights to target buyers whose browsing behavior profile qualifies them as a future 

23 sales prospect. E.g., instead of cash credit as the profile is identified, the offer could 

24 then become modified to offer free hardware or credit to their purchase. Because PCs 

25 are highly portable, the advertising targeting techniques described in LEI A could add 

26 substantial additional value to advertisers. A similar model could be used for 

27 manufacturers of PDAs. 
28 

29 © Allow matchmaking of buyers based on profiles across a network of 

30 iamworthit/community-dollar sites. 
31 
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1 A menu of different forums and chats can be displayed on each iamworthit member 

2 site, (the underlying methodology for which is described in co-pending patent 

3 application "Virtual Community Service for System for Customized Electronic 

4 Identification of Desirable Objects"). In accordance with this specification, a 

5 variation of the service involves the process for identifying individuals who most 

6 closely match a given category or target object. For example in the context of the 

7 present implementation a category or content, merchandise or a purchasable being 

8 specially promoted may be the focal point of a discussion forum or chat room, which 

9 is automatically organized by the Virtual Community agent. Accordingly, a portal (or 

10 in accordance with the present trend) a site with which a portal interface is integrated 

1 1 utilize the present techniques for generating virtual communities for each category or 

12 sub-category of content on the portal or for direct access into a forum or chat room 

13 which was automatically created around that particular site (as the target object used 

14 as the matching criterion). As described, the user may navigate a hierarchical menu 

15 of virtual communities which may be constructed automatically according to the 

16 methods described which involves communities assigned to category, sub-category, 

17 and association with corresponding sites. Ideally in this scheme the portal is actually a 

1 8 "virtual portal" which may be utilized in providing access to the communities across 

19 numerous sites (and/or ISP home-pages). Users may also be navigated (at the 

20 individual user level) which along with their pseudonymous user profile data is 

21 subject to their data release policies. In a variation of the above schemes, if there is 

22 geographical information which is associated and which is released in accordance 

23 with the above individuals and/pr communities (e.g., as may be occurring or 

24 scheduled to occur in physical space), LEIA may be employed as a primary (or 

25 additional) selection criteria for navigating the present information accordingly. 
26 

27 o Advertising in Exchange for Equity. 
28 

29 A potentially attractive optional form of value, which could be provided to 

30 iamworthit customers involves equity shares in companies which advertise to the 

31 buyer (in lieu of community dollar credit or cash). This scheme is an ideal 
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1 application for iamworthit in as much as iamworthit customers can be highly targeted 

2 and because many Internet-based start-ups are highly niche community oriented (thus 

3 iamworthit customers who are interested in the sites can be efficiently identified and 

4 targeted). Moreover advertising is typically very expensive which in the absence of 

5 accurate targeting may be of questionable value. It should be noted, however, that 

6 because the primary objective is to both find viable prospects and to engender an 

7 element of loyalty (which the equity model does). This scheme would be the 

8 preferred approach to advertising for sites which do not sell on-line where community 

9 dollars would be the preferred loyalty engendering scheme. In order for this model to 

10 substantially provide its desired advantages of increased advertising exposure to 

1 1 fledgling web based companies, the iamworthit subscriber base would have to be 

1 2 quite substantial. 
13 

1 4 o Loyalty credits for off line retailers. 
15 

16 Deliver through the back of sales receipts, kiosks or direct mail or on-line substantial 

17 purchase credit to retailers (e.g. grocers') customers, using the aforementioned 

18 technique of utilizing a unique URL to identify the vendor and/or promotion from 

19 which an iamworthit subscriber originally accessed the iamworthit subscription site 

20 (thus identifying for both buyer and vendor the appropriate denomination and/or 

21 terms of community dollars issued to the buyer). In the preferred implementation, a 

22 loyalty card is used to identify the buyer thus enabling the community dollars value to 

23 be provided to the customer at check-out as straight credit or possibly an 

24 enhancement to loyalty credit. The buyer may also be identified via credit card or 

25 alternatively a voucher (or coupon) could be printed from the buyer's computer or 

26 from a kiosk which is typically situated near the entrance to the store and which could 

27 be activated upon insertion of a loyalty card credit card (or associated authorization 

28 code) and could also be used to disclose the buyer's community credit balance. A 

29 unique identifier for that voucher or coupon is provided and non-tamperability 

30 measures are provided such that the buyer's community dollars account can be 

31 appropriately debited upon redemption. Preferably, a pre-determined value is 
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1 specified on each voucher (which could be predetermined by the service or the buyer) 

2 or alternatively, the total community dollars balance could be specified on the 

3 voucher along with the buyer's name/address and redeemable only upon presentation 

4 of valid buyer ID. 
5 

6 o Free ASP Services - Web-centric applications are becoming an increasing central 

7 part of the e-business infrastructure. iamworthit could potentially enable 

8 organizations to gain free access to these Web-centric applications in exchange 

9 for iamworthit enabling their employees and customers (e.g. which may use some 
1 0 of these applications as well). 

11 

12 o Credit to a User's Credit Card - Many credit cards are tailoring promotions to 

13 enhance not only acquisition of market share but also loyalty of its users. The 

14 user-centric SDI provides an enhanced (e.g. rebates for expenditures) platform 

15 which could provide more efficacious loyalty enhancement marketing strategies 

16 for card issuers. For example, a credit card user could be given certain credit 

17 towards the card (e.g. via a promotional offer) for becoming an iamworthit 

18 subscriber (which is largely ad revenue supported). An additional novel 

19 application could involve providing an offer for an additional benefit which could 

20 be provided if the user performs all of his/her on-line transactions using the 

21 present credit and (i.e. wherever the card is accepted). 
22 

23 An iamworthit pseudonymous credit card [e.g. LMP 94] could be provided whereby 

24 users collect credit for advertising (e. g. ? on billing statements for the ad) which the 

25 user receives as well as the advertising delivered via iamworthit's on-line (Web or 

26 targeted email) as well as pseudonymous physical mail, telemarketing calls using user 

27 profile pseudonyms and one all time or pseudonymous phone numbers. This user 

28 profile data consists in part of off-line data and on-line data. 
29 

30 o Free Personal Portal for Individuals. 
31 
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1 It is reasonable to offer individuals completely free, Web design/development and 

2 hosting services which are offered and mass marketed. At the time this patent was 

3 written, Web developers were offering such free services in exchange for revenue 

4 sharing for advertising product syndication or e-commerce offering on the 

5 individual's site. Iamworthit could conceivably be deployed in conjunction with this 

6 commercial model whereby either a portion of the user's own iamworthit profiles are 

7 used to subsidize (or subsidize in part) the portal and the developer and/or revenues 

8 shared from profits from iamworthit subscribers who subscribed from the portal are 

9 utilized also (or instead). As such, it is also reasonable for such a developer to 

10 provide links on affiliate sites (e.g. access an affiliate network) which offers free 

1 1 individual portals in exchange for subscribing to iamworthit (and perhaps agreeing to 

12 offer a similar iamworthit expectation from that portal as well from which the user 

13 can also share in a portion of the revenue stream. Such a program could even be 

14 structured in its revenue distribution to the subscribers as a multi-level marketing 

1 5 network. Users could even receive value for providing links to the vendors site either 

16 both (e.g. as "cob- — shopping portal) direct off-line 

1 7 encounters in the other users as prospective customers to the URL. (where the URL is 

1 8 unique for each individual. 
19 

20 o Multi-level Marketing Network. 
21 

22 This economic model for distribution of shared revenues from iamworthit may be 

23 implemented several ways. For example, as above suggested, the user's Web portal 

24 offering solicitations for iamworthit (as a standard offering or in combination with a 

25 free personal portal), could be implemented in a number of different ways, such as ; 

26 a). The solicitation is presented upon the user's Web portal, b). The solicitation is 

27 made through the traditional sales channel of direct one-o-one correspondence with 

28 friends or associates, c). The solicitation is attached to any email messages of the 

29 user such solicitation provides a link to enable the user to subscribe. 
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1 13.1 .6.1 Agent-Mediated Value Exchange in the Supply-Chain 

2 The information in the SDI data warehouse can allow vendors to identify business 

3 relationships between other vendors, for example between vendors and their suppliers, 

4 and between vendors and their customers. With this information a vendor might attempt 

5 to exert pressure on another vendor by making direct offers and incentives to their 

6 suppliers and/or their customers. Of course, in all cases information is only available 

7 subject to price and disclosure policies, but with there being at least two parties in any 

8 deal no vendor can unilaterally prevent this kind of information from reaching the SDI 

9 enabled data market place. 

10 Information may include the benefits/disincentives associated with particular 

1 1 actions, and help to identify actionable events. Agents might offer other vendors a share 

12 in some future profits that result from taking a particular action, so that little up-front 

13 collateral is required. The information in the data market can be used to estimate the 

14 value of this share, for example, based on information about expected trade volumes from 

1 5 projected market share. 

16 In a variation, SDI may also assume the task of negotiating on behalf of each entity 

1 7 involved. This is useful when information is sensitive and/or confidential, SDI can allow 

18 negotiation without identification until a deal is struck. In this section we describe an 

19 application of SDI to an agent-based economic infrastructure in which value exchange is 

20 used to create "efficient supply chains and contracts". 

21 It is critical within this system that agents can specify quite elaborate privacy 

22 policies, so that they can submit sensitive information to the SDI data warehouse but be sure 

23 that the information will not be revealed to competitors. One consequence is that SDI will 

24 compute efficient supply-chain structures based on the information provided by agents, but 

25 be able to report a good set of contracts without providing complete rationale (e.g. plans, 

26 strategies and or future technologies) for the basis of the identified solution. An agent may 

27 be asked to apply economic pressure within a supply chain without understanding why it 

28 will receive personal (and perhaps long-term) benefits. In another variation, if SDI is able to 

29 keep track of an entity's competitors and cooperating partners, then certain amounts of 

30 information can be carefully released to justify decisions. 
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1 SDI must retain a considerable amount of autonomy, to apply its knowledge to act 

2 in the best interest of these entities which it represents without being able to disclose the 

3 rationale for their contract recommendation. There is a very significant amount of trusted 

4 information regarding all of the commercial activities, internal operations, marketing , 

5 strategic business and product development strategies, etc. which must be disclosed to 

6 SDI for an accurate assessment regarding the present and predicted commercial impact of 

7 new supply-chain contracts. 

8 Furthermore, we must prevent a vendor from exaggerating its value of certain 

9 outcomes, to achieve a good solution. Given this incentive-compatibility problem^ and the 

10 potential problem of vendors entrusting so much extraordinarily delicate information to a 

1 1 single entity, we could instead allow SDI to assume many of the roles of a consulting 

12 firm. In this case SDI controls the reporting of information to SDI, and can verify its 

13 accuracy. 

1 4 The mechanism for providing value to the recipient (in exchange for providing the 

15 desired action) is flexible and may include (as suggested above) "upside" value, direct 

1 6 compensation (or direct "redeemable value"), or an exchange of actions between agents 

17 (I will do X if you will do Y). Part of a deal may be that the provider of a requested 

1 8 action guarantees exclusivity and that it will not provide a similar action for competitors. 

1 9 A vendor might identify and propose deals to the customers of another vendor, to 

20 apply economic pressure through threatened boycott and compel the vendor to perform a 

21 desired action. This "indirect" economic leverage may be applied at the consumer 

22 (lowest) level in the supply chain, and passed upward. In addition to boycott, a vendor 

23 could promise please note, shouldn't consumers be termed customers of the vendor since 

24 consumers cannot be threatened with boycott) consumers credit in return for taking a 

25 particular action. SDI can mediate the entire electronic control, transfer, fulfillment and 

26 negotiation processes (typically XML-based) individually on behalf of each entity within 

27 this multi-step negotiation process. In the above applications SDI can represent each 

28 entity in the negotiation process, and also have a role in providing information and 

29 statistics, expert opinions, etc. which clearly elucidate the value and potential value 

30 opportunities available to the entity(s). 
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1 The current model allows different supply chains to compete for the loyalty of the 

2 consumer, e.g., through providing indirect economic value to the consumer as in kind 

3 products/services and/or upside in the benefiting entity. 

4 Economic alliances can be created between vendors, suppliers and consumers 

5 within the same supply chain. SDI can act on behalf of different supply chains and 

6 establish a competing market between different chains for the business and ultimate 

7 loyalty of purchasers. End purchasers have a large economic leverage on a supply-chain. 

8 The decision of an end purchaser will depend on: 
9 

10 1 ) The type, selection and quality of personalized products/services 

11 

12 2) The economic value which a given supply chain is willing to provide a user in 

13 exchange for complying with particular requests (i.e., favors). This factor may 

1 4 ideally translate to price or loyalty credit. 
15 

16 A part of a consumer's decision may be based on which vendors are likely to use 

17 personal data effectively in the future, to provide for example highly customized 

18 products/services for the individual. Supply chains which can deliver products and 

19 services which are most closely aligned to the needs/preferences of the customer should 

20 ultimately win out by capturing the customer's loyalty. A consumer's personal data can 

21 be quite valuable to a supply-chain, especially if a consumer provides one supply-chain 

22 with the exclusive right to its personal data. In application to the virtual sales person 

23 scheme, customers can receive incentives to solicit other customers as introduced 

24 dynamically via SDI and LEIA [Patent No. XXX], customers that match desirable user 

25 profiles. The payment that a customer receives for committing to a supply-chain can be 

26 received from all members of the supply-chain that can gain value from the customer, 

27 with "considerations" passed down the chain which are eventually provided in some form 

28 of loyalty credit to the customer by the vendor at the end of the chain. Some of the value 

29 conveyed back to the user can, of course, be in the form of community credit, e.g. 

30 subsidizing a portion of the customer's needs from vendors within a supply chain. 
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1 In this model we can also allow an employee to receive benefits to reflect the 

2 value that s/he provides within the supply chain. Again, a vendor's supplier might even 

3 provide a benefit to encourage the employee to perform well. Valuable employees may 

4 receive more "value bonuses" the longer they stay with the organization, to encourage 

5 them to stay with an organization which needs them, and without the ability to leave a 

6 company once the value of share options has been realized. 

7 With interwoven supply-chains, for example, with the same supplier connected to 

8 multiple vendors, and vendors with competing suppliers, this method of passing value 

9 along the supply chain becomes more tricky. When a consumer applies economic 

1 0 pressure on a vendor that it deals with in the supply chain, that vendor may inform other 

1 1 suppliers of the pressure and block service to the user. A bartering system can provide 

12 additional benefits, to allow vendors to exchange bonuses, discounts, etc. with favors 

13 offered to vendors in other value chains. In general it is quite likely that a favor will not 

14 substantially benefit a user in another supply chain. We can also include a method to 

15 prevent a vendor that does not itself offer favors in the supply chain from taking 

16 advantage of favors offered by other vendors. This can support cooperation within the 

1 7 supply chain, and allow effective competition with other supply chains. 

1 8 Each vendor in the supply chain can benefit in two ways: 
19 

20 1) They receive a "commission" on the transactions in which they convey the favor 

21 to their customer (which again may be "upside" in the ultimate beneficiary of the 

22 favor and/or product/service loyalty credit with their own supplier). 
23 

24 2) (Most importantly), they receive customer loyalty advantages via the community 

25 credit they provide such customers. 
26 

27 For reasons of this latter benefit they are likely to compete with other competitive 

28 vendors for the privilege of exclusively delivering the incentive to evoke the favor to the 

29 next level up the supply chain. In this way each vendor below the supplier requesting 

30 stands to gain via customer loyalty benefits by competing with another supply chain. 
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1 A consumer can provide value to a supply chain by exclusive provision of 

2 personal data, to allow a supply chain to create specially customized and targeted 

3 products and services. However, it is not necessary for a customer to have an exclusive 

4 contract with a supply chain (although this might bring greater loyalty benefits and 

5 payments). 

6 A variety of different techniques and methods should be integrated and deployed 

7 to implement this agent-mediated supply chain value exchange system: 
8 

9 1) Digital contracts to define and enforce particular transactions and long term 

1 0 business relationships. 
11 

12 2) An SDI infrastructure to suggest particular opportunities for a vendor, based on 

1 3 the benefits to the complete supply chain. SDI can utilize as much information as 

14 the vendor is willing to entrust in SDI, to try to identify present and predictive 

1 5 future benefits resulting from actions on the part of other vendors. 
16 

17 3) A means for enabling customers and/or vendors to form coalitions, to strengthen 

1 8 their ability to provide pressures on other vendors. 
19 

20 4) A mechanism with useful incentives for these customers and vendors guarantee 

2 1 direct benefit even if the desired action ultimately does not result in new value, 

22 e.g., an insurance scheme. 
23 

24 5) A method to exchange terms and conditions between members in a supply chain, 

25 by being passed up the supply chain from the lowest level (consumer), and a 

26 mechanism to provide "fair compensation" for this process. 
27 

28 6) A method to integrate loyalty benefits and long-term purchasing contracts, etc. 

29 into an electronic cash protocol, so that buyers are aware of all contractual 

30 constraints during negotiations with agents. 
31 
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1 7) Methods within SDI to advise a customer about an appropriate bidding strategy, 

2 based on estimated values of different contracts and supply chains, to help the 

3 customer to make a good decision. 
4 

5 8) Methods within SDI to recommend information to reveal to a supply chain, to 

6 maximize the value of a loyalty bonus offered and ultimately received by an 

7 agent. Of course, information cannot be falsified, but some information can be 

8 withheld. 
9 

1 0 Applications of "bribing" the consumers of a vendor: 
11 

12 o A commercial entity is sustaining bad publicity from an article published in 

13 newspaper X. The company could offer the customer base of the publication 

14 (identified via SDI) an incentive to temporarily disrupt or boycott the publication 

15 until corrections/changes are made to the article. The company could also offer 

16 discounted products or services to the consumers, based on information (from 

17 SDI) about their consumption patterns. A more indirect discount could be 

18 provided via other vendors further up the supply chain, via multi-vendor 

19 exchange. 
20 

21 o Company A is a high tech start up selling software products and services to 

22 established vendor B. Vendor B identifies the substantial value proposition of the 

23 products and services provided by the start up, and decides that it should provide 

24 the services for itself "in-house". Start up company A can go to the consumers of 

25 vendor B (information from SDI) and offer a percentage upside, e.g. percentage 

26 equity in the start up, if they will apply a threat to boycott the vendor unless the 

27 vendor agrees to do business with the startup. Company A might even identify 

28 consumers or suppliers further down the supply chain, and apply indirect pressure 

29 to B. 
30 
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1 o Company A is developing a new commercial initiative which may be competitive 

2 to that of another smaller company B, and might harm B's market share. Assume 

3 that consumers prefer B's products to the future products of A, and that B can 

4 make more profits than A. Company B could convince the customers of company 

5 A of these facts, offering them a percentage of potential upside and/or discounted 

6 or value credit towards products. The customer profiles of company As 

7 customers also reveal the vendors from which they most frequently purchase, and 

8 company B could also offer equity or otherwise some potential upside to these 

9 vendors in exchange for offering discounts or credits to their customers (which 
10 are customer A's customers as well). 

11 

12 In place of monetary compensation, we might provide a value payment to an agent in the 

13 form of a long-term compensation, based on the long-term value achieved by an agent 

1 4 further up the supply chain. 
15 

16 o SDI can identify the paths in the supply chain between companies, to allow a 

17 sequential transfer of benefits down a chain. Ideally, each entity providing the 

18 benefit is doing business with another next lowest in the supply chain, so that 

1 9 value can be exchanged. 

20 o SDI might "bribe" each party within a value chain to enter into a contractual 

2 1 relation which benefits that party (directly or indirectly). 

22 o SDI can determine what companies along this chain would have the greatest 

23 future potential to benefit one another, i.e., where the supplier can generate 

24 compelling long term benefit to each of his subsequent buyers at each level down 

25 the chain. 

26 o SDI can use available information regarding a vendor's internal and external 

27 activities and goals, to identify optimal multi-vendor supply chain models, in 

28 terms of maximizing the sum value to all members in the supply chain, and the 

29 value of each individual company. 

30 o SDI must also consider the willingness of each vendor to establish the necessary 

3 1 contractual relationships for the new supply chain. Vendors might provide SDI 
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1 with "relationship policies" and "contract policies", to allow this to be factored 

2 into an SDI recommendation. 

3 o SDI models, in detail, the economic value which each scenario provides to the 

4 collective vendors and to each individual vendor. SDI can inform each individual 

5 vendors of its own predicted value, while withholding information about other 

6 vendors. 

7 o SDI can seek coalitions of vendors with similar interests to combine their 

8 bargaining power, and attempt to compel vendors to enter particular preferred 

9 value chain scenarios. 
10 

1 1 SDI is privy to most or all of the information relating to all of the companies in a supply 

12 chain, and can be trusted with the task of creating strategic plans between different 

1 3 companies, potentially making the supply-chain more efficient and benefiting all vendors. 

1 4 The strategic plan might suggest: 
15 

16 1 ) New business relationships between the most synergistic entities 
17 

18 2) Contracts for existing commercial relationships which embody and enforce 

19 specific terms of those relationships within the framework of this optimal supply 

20 chain system. 
21 

22 SDI might also create a detailed system for accounting for the effect of a contract 

23 on the rest of the supply chain (individually and collectively), and predict short and long- 

24 term effects of contracts in advance. This information can be used to encourage the 

25 desired contractual activities on the part of vendors in a chain. There may also be cases 

26 in which the direct/indirect economic impact of a contract upon a vendor results from an 

27 additional secondary economic advantage to that vendor besides the trickle down 

28 economic effect; for example because of strategic commercial benefits. These factors 

29 can also be considered in an economic benefit model. 

30 Given models to compute the comparative economic value of proposed contracts 

31 to vendors in a supply chain we can demonstrate a "trickle down" economic value chain 
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1 in which the success of company A is directly related to the success of company B. This 

2 information can be presented to company A and each intervening supplier. SDI can then 

3 establish long term contracts between each entity in the chain to insure that each 

4 company receives the benefits of products and services from its other suppliers which are 

5 of greater value than without the contract. This trickle-down benefit can be used to entice 

6 a company to enter into a recommended contract, for its own benefit and the benefit to 

7 company B. 

8 The benefits which each vendor is predicted to receive in a proposed value chain 

9 can be pooled together and used to "bribe" each vendor within the value chain to enter 

10 into the preferred contract. While some value chains may actually be willing to provide 

1 1 greater compensation to a vendor, the goal is to link a vendor's decision to the value of 

1 2 the complete value chain, both future and present. 

13 Of course, vendors may face risks in committing to long-term contracts, 

14 particularly when many vendors in the chain must prosper for the value to be realized. 

15 SDI might introduce a number of decommitment clauses to allow a vendor to back out of 

1 6 a plan, perhaps reverting to a "plan B" which protects the interests of harmed vendors 

17 while retaining as much value to other vendors, i.e. allowing all vendors to contribute to 

1 8 the cost of failed contracts. Alternatively, we might create a futures market in which the 

1 9 risk of the future value can be traded in real options, within a public market — providing a 

20 financial instrument to share risk. In this case, if some measurable form of economic 

21 benefit such as sales value (for e.g., the particular relevant product line), possibly 

22 stock values, profits, etc. falls below a target level the vendor may be compensated for its 

23 initial risk, i.e., secure a worse-case outcome. 

24 It may be important in high-growth markets such as technology, with high risk 

25 companies, to allow insurance for contractual non-performance of suppliers. 

26 SDI can leverage available data to compute the most beneficial economic scenario 

27 for all associated vendors collectively, and also to compute optimal scenarios for each 

28 individual entity. SDI can disclose the complete picture to each agent, and then allow 

29 individual vendors to optimize their local plans in a global plan. To effect a new scenario 

30 though, agent agreement is required. Another mechanism could have SDI report the 

31 optimal global scenario, and the optimal scenario for each individual agent. Then we 
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1 could allow agents to negotiate, to find a multi-agent compromise outcome that is more 

2 efficient than the current solution but possibly not as good as the globally optimal new 

3 supply chain. The decision of an individual agent to enter into a proposed contract will be 

4 determined by the cumulative value, short term and/or long term, to that entity, consisting 

5 of the predicted value of taking an action (e.g. breaking a contract, initializing a new 

6 contract) both in terms of immediate bonus to a vendor, long-term individual benefit, and 

7 trickle-down benefit because of global supply chain improvements. 

8 Each vendor must provide SDI with as much information as possible about the 

9 particular conditions under which it will perform new contracts, such as its preference 

10 between short term and long term predicted benefits, etc. SDI looks for outcomes that are 

1 1 favorable to individuals and to the overall system. Value can be realized in short-term 

12 payments and long-term bonuses. 
13 

14 Additional examples include: 
15 

16 o Company A may threaten to file a lawsuit against company B (which could be 

17 very damaging to company B). In a similar barter exchange, company B can 

18 barter its own value (in the form of upside or products/services) as offered to the 

19 customers (or vendors of the customers) of company A, targeted to only those 

20 customers that reside within the same supply chain. Thus, by limiting the value 

21 proposition to only these entities, and not to those which reside within the supply 

22 chain of a competitor, the value of the potential upside is worth more because it is 

23 partially predicated upon their own direct success. Again, it is also possible to use 

24 an indirect strategy, bribing entities or individuals of influence within a given 

25 company from which a particular action is desired. 
26 

27 o In another novel application there are certain activities among certain individuals 

28 which are considered to be mutually and collectively advantageous or 

29 disadvantageous to all members of a certain vendor community. We can have all 

30 members contribute money into a fund. The desired actions would be rewarded 

31 appropriately while these not desirable are punished. If/when an undesirable 
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1 action is performed all vendors can threaten to boycott that action (creating a 

2 significant disincentive to the user). The individuals may also be subject to 

3 boycott by other entities which the present individuals rely upon economically (as 

4 buyers or sellers). 
5 

6 o Company B is a large entity discussing a major partnership deal with company C 

7 which involves a large development initiative to supply company C with 

8 technology. SDI is confidentially apprised of the negotiation via company C and, 

9 in accordance with its privacy policy, allows SDI to notify and apprise company 

10 A of this developing prospective deal. Company A (a smaller company) initiates 

11 a counteroffer to company C, and is willing to share a portion of its upside 

1 2 (resulting from the deal directly or from its own equity) to customers of C if they 

13 can convince company C to deal with company A. SDI may finally suggest a 

14 boycott or reduction in business unless company A is selected. 
15 

16 o A high tech start up is in the process of seeking new or additional funding. SDI 

17 can identify a potential customer base and collection of strategic partners, and 

18 also likely indirect beneficiaries of the technology, products and services in the 

19 supply chain. These entities and their consumers can then be incentivized to place 

20 economic pressure on the necessary financing institutions. The commercial 

21 entities themselves may also be incentivized, for example potential direct 

22 suppliers or customers can be informed that investment will provide an 

23 opportunity for new guaranteed business or value concessions. These entities 

24 might also apply direct pressure on their own financing institutions. The scheme 

25 extends to existing suppliers and customers. 
26 

27 o An entrepreneur with a new start up venture can find companies (e.g. suppliers, 

28 etc.) that will benefit from the new company, and seek funding from those 

29 entities, in return for providing a long-term share in the company. SDI can 

30 contract all the relevant strategic partners before a funding commitment, to 

31 provide more information and reduce risk. Other equity shares could flow from 
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1 marketing agreements, e.g. a co-branded affiliation of all further 

2 products/services. The new company might also guarantee an exclusive contract 

3 with another entity. An entrepreneur might also be willing to sell a company to 

4 the more established entity, e.g., within a year or two, this can be contracted at the 

5 outset. The present framework allows value from long-term strategic alliances to 

6 be released. 
7 

8 Within SDI we can adapt a user's personalized portal browser to favor the 

9 vendors providing such value opportunities to the consumer. A vendor could achieve 

10 additional market share by complying with requests of another vendor. The value 

1 1 proposition as presented to the user can leverage the personalized browser user interface 

1 2 to prioritize a vendor's offers in the future. 
13 

1 4 Example Commercial Application. 
15 

1 6 An investor has invested in a very high risk venture relating to highly speculative 

17 stem-cell research. After years and millions of dollars, the venture finally goes out of 

1 8 business (or becomes acquired on the verge of bankruptcy) the investor loses all of her 

19 investment, however terms in the original investment contract (which involved at the 

20 time the approval of a variety of medical organizations most likely to benefit from the 

21 eventual medical science benefits of the research) now enable the investor to acquire in 

22 kind benefits (which may include anything from equity in these various medical 

23 organizations to goods/services provided by them). A portion of the value may even 

24 include a sizable amount of value provided by the original company and/or indirectly vis- 

25 a-vie the remaining medical entities, a substantial amount of value within the barter 

26 exchange for favors. 
27 

28 With time, stem cell medical applications proliferate thus, the t4 upside benefit" 

29 increases in proportion over time. However, if this "up-side benefit" (of the original 

30 collection of contracts from the various relevant companies in the field to which the 

3 1 original research activities pertained) was insured through the futures market the investor 
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1 can benefit from the value and growth. Investors can include researchers whose 

2 contribution of time and effort and dedicated focus on the problem had been estimated. 

3 This can allow valuable talent to receive the incentives to join the initiative from the 

4 outset in a way which would be very difficult to achieve with other standard recruiting 

5 approaches. It is important in this scenario, however not to over incentivize the managers 

6 and executives such that they may lose incentive to make the company succeed. In this 

7 scenario that the company does not go bankrupt, proper incentives could be provided to 

8 further encourage similar scientific and innovative entrepreneurial initiatives in the 

9 future, thus a portion (perhaps a fraction of the upside in these other entities could be 

10 provided even in the event of success (and irrespective of the level of risk associated with 

1 1 the original venture). Certainly the field of stem cell research is only one example of 

1 2 many speculative new technology fields with potential for enormous overall impact upon 

1 3 all aspects of the economy and society. Another example cited elucidates other potential 

14 features and variations of the present novel scheme. The field of na — technology is 

15 believed by many to become perhaps the most important and pervasive technology 

16 paradigm of the twenty-first century impacting almost every industrial sector. As such, 

1 7 many different critical problems will need to be solved, each with potentially enormous 

18 associated economic opportunity. There is, however, considerable risk associated with 

19 the probability for industry and research institutions to realize these significant returns on 

20 investment for any given industrial sector or application domain (however, by no means 

21 is this true for the overall field of nanotech). Also breakthroughs in any one application 

22 domain will substantially advance the state of the art for potentially all other application 

23 domains. It is reasonable for.SDI to create a futures market in which it predicts certain 

24 long term but very realistic goals for each application domain of nanotech. Through 

25 cooperation with industry and the investment community it may further pre-contract with 

26 various present and potentially future entities which are most likely to be direct 

27 beneficiaries of the technology which is spawned from the initial extremely high risk 

28 commercial ventures pioneering the basic technology. As in the stem cell commercial 

29 example percentages in the various longer term beneficiaries can be used as a strong 

30 inducement to present prospective investors in the pioneering commercial entities. As 

31 suggested, this percentage may be inversely related to the degree of success of the initial 
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1 venture and if desired, the risk may be further reduced through the use of a futures market 

2 which essentially assures the predicted economic marks of these various 

3 prospective commercial endeavors. The wise investor in the futures market will seek 

4 diversification of investment across many different technological sectors of nanotech 

5 (thus assuring the large upside which is certainly to occur universally but not necessarily 

6 for any one given application commercial domain. SDI could further mediate the long- 

7 term opportunities to the original investor by mediating, for example, technology transfer 

8 and licensing intellectual property to present or prospective commercial entities which 

9 have thus agreed to operate under the terms as provided by SDI. In exchange, SDI could 

10 also (as above suggested) provide a resource and technology sharing service and 

11 associated partnership brokering service (between potential multiple companies 

12 containing potential commercial synergies). In addition it may work with the investment 

13 (particularly incubator community) by which it may, without disclosure, of commercial 

14 strategies and technologies which are presently or prospectively planned across the entire 

1 5 nanotech industry of disclosed suggestions for particular commercial and technological 

16 opportunities which would appropriately complement and benefit the global strategy or 

17 nanotech which is known exclusively buy SDI. This approach in addition to emulating 

18 the same united focus and integration of multifaceted technological and R and D 

19 initiatives can also ensure the proper distribution of efforts within each technical 

20 application domain, the appropriate sharing of information whenever potential for 

21 technological complementarity, but not commercial competition exists (which SDI) 

22 strives to achieve on an industry wide basis) and the assurance that companies and 

23 departments do not duplicate their efforts if at all possible. The investment community 

24 through SDI could certainly stimulate conditions as above which band aid the adherence 

25 to these terms as a condition to investing. SDI, through incubators could additionally 

26 disclose particular commercial and technologic opportunities which are both synergistic 

27 to the global SDI strategy as well as pre determine (visa vie SDFs extensive in depth 

28 research on the industry to possess significant commercial opportunity. Based upon 

29 some existing companies technical proficiency and commercial strategies, some of these 

30 emerging opportunities could be disclosed as potential (suggested extension of existing 



Page 226 of 370 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 departments or divisions based upon their individual areas of expertise and particular skill 

2 sets of its employees. 

3 The present system methodology (of business to business and business to 

4 consumer) may also be extended to include other variations including consumer to 

5 business or consumer to consumer. An example of the former could be an employee who 

6 is slated for lay-off or firing within a large organization. The indirect effect of the lay-off 

7 could result in a negative impact upon the individual's financial welfare and indirectly 

8 upon the professional contributions of that employee's children. The likely potential 

9 beneficiaries (corporations) may identify potential lost revenue streams and effectively 

1 0 bribe the present organization with "value" to avoid the lay off. It is also noteworthy to 

1 1 mention that the justification for accepting the offer for the second organization is more 

12 compelling if there exists a "trickle down" economic impact, ultimately upon the first 

1 3 organization (and certainly SDI, within the present framework, would factor such small 

14 economic negative factors into the optimization model for the most efficient supply chain 

1 5 of vendor relationships). 

16 Within the scope of the present example, it is possible to create a form of 

17 insurance in which SDI identifies extremely similar circumstances where similarity is 

1 8 measured primarily by the nature and scope of the opportunity and its associated risks as 

19 well as the entity whose critical action is necessary to bring about the desired objective. 

20 In this case, the individuals (or entities) benefiting from the desired action are able to 

21 effectively conceive to contribute a certain level of value to the collective group to 

22 effectively insure the desirable outcome on behalf of all of the members of that group, 

23 such that, if the desirable action on behalf of the other entity (in this case the large 

24 organization contemplating substantial layoffs) does not concede to the desired action for 

25 one or more of the individuals, this value contributed from the group is used to apply still 

26 greater economic pressure upon the entity to perform the desired action. If this does not 

27 facilitate the action, the value is instead applied in another (albeit less optimal) form to 

28 compensate at least a portion of negative economic impact upon that individual s/entity of 

29 course, the level of risk may influence the amount of insurance each individual/entity is 

30 willing to contribute within a given group. 
31 
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1 1. There are numerous other extensions and potentially practical applications of the 

2 present novel methodology, particularly those applied primarily at the consumer 

3 level, and where consumers are provided with an infomediary such as SDI to 

4 represent the most individually beneficial actions within the present agent mediated 

5 contract negotiation framework. One of the significant technical advances of SDI is 

6 the ability to not only represent the particular best interests of an individual (or other 

7 entity but also provide; (1) knowledgeable experts (or "knowledge proxies") who 

8 within the information rich economic and political universe is able to off load the task 

9 of acquiring constantly updated and changing and expanding expert knowledge 

10 within each domain and sector of the economic and political systems. In this regard 

1 1 SDI may, as a service to consumers, represent consumers according to which overall 

12 economic scenario which best represents their socio-economic interests which may 

13 factor in both long term or short term monetary benefits and/or socio-economic 

14 benefits and/or (if they differ in any way from above) personal or political views or 

15 objectives. The use of collaborative filtering may provide a useful tool for assuring 

1 6 that proper recommendations for political views reflects economic political and socio- 

17 political information available about the user, such economic data of which is 

1 8 typically gleaned passively and may be confirmed by active confirmation of the user 

19 along with political views and positions on political issues which may prompt the 

20 user in order to collect rapid profiling information about the user is overall political 

21 objectives. Thus SDI can effectively in practical terms provide a "continual" 

22 personal advisor to guide all consumer activities (via contracts) as political decision 

23 making policies, and even these consumer actions to indirectly (via the supply chain) 

24 apply political pressure upon organizations in order to achieve certain desired 

25 economic, or even politically advantageous objectives, at an individual level. 

26 Accordingly, once they are certified on an individual level, SDI could provide 

27 synergistic benefit to the ultimate political objectives of each person individually by 

28 creating a "global political support strategy" for each group of individuals sharing the 

29 same political objectives. SDI may develop information models based on relevant 

30 historical data of what political and economic impacts resulted from politicians and 

31 political camps and combinations thereof within political and economic conditions 



Page 228 of 370 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 which are similar. Within the framework of this methodology an extremely valuable 

2 technical advantage of SDI is its capability to monitor huge volumes of information 

3 from not only organizations but also news feeds, legal cases, government regulation 

4 over industry and profession (which may indirectly affect quality of life of 

5 individuals), and/or monetary status of individuals. Case in point, within the health 

6 care industry there is a significant amount of covert, however, aggregious attempts by 

7 both government and private sector HMO's to limit the quality, scope and depth of 

8 health care services which can be provided to the consumer. In accordance with the 

9 present SDI framework, an SDI agent acting on behalf of each individual could 

10 effectively create an economic and political coalition with the collective economic 

1 1 ability to boycott the most aggregious violators of their monetary and health care 

1 2 benefits, apply political pressure through believable, perhaps even irrefutable, threats 

13 (through individual contractual obligations with SDI) to mandate changes to those 

14 particular policies (be at the governmental, regulatory or legal levels) so as to 

15 bring about the desired results at the consequence of voting the relevant decision 

16 makers out of office (directly or vis-a-vie their political appointees). SDI, in 

17 accordance with its functional objectives could also certainly supply valuable 

1 8 information to users supporting its internal rationale for structuring such coalitions by 

19 estimating and presenting to the user such data as the predicted degree of reduction in 

20 the quality of health care (from physicians and hospital for that user and her family 

21 based upon the negative impact which the present system has upon these entities. In 

22 addition, it may be advantageous for the most aggregious abusers to be boycotted by 

23 individuals collectively. Based upon the knowledge available to SDI, it may be 

24 advantageous to pre-commit or provide a polling-based solution for the individuals 

25 constituting these coalitions as to their responses to the desirable actions prescribed if 

26 the user rejects the recommendation, SDI may further query the user and/or suggest 

27 an alternate strategy (suggest one to SDI). This data about the political issue 

28 strategy(s) which the user support may then be used in order to provide direct 

29 feedback to politicians with sufficient notice with which to pre-formulate their 

30 political strategies at both the micro and macro levels with which to best comply with 

3 1 the adopted or otherwise recommended political and/or economic edicts according to 
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1 SDI. Or, if it is determined that such pre-knowledge could be abused in such a way 

2 as to politically manipulate campaigns, large employers, etc. undermine the political 

3 support for these edicts, SDI may determine that it is advantageous to not disclose 

4 such information in advance to the target political groups (this, however, is a very 

5 subjective and complex analytical process). Thus SDI, representing each common 

6 group, must determine and predict which politicians are most likely to ultimately 

7 support the issues which are most important to their political objectives for which 

8 advanced disclosure of their own political support strategy is likely to advance the 

9 strategic knowledge of that politician while assuring that none of their positions on 

10 none of the political issues are likely to be ultimately abused by that politician by 

1 1 him/her ultimately changing his/her position on that issue in order to improve his/her 

12 chances of election. For this reason, it is also of interest to each group to attempt to 

1 3 predict the ultimate position that each other group is likely to assure for each political 

14 issue in order to determine whether or not pooling their collective influence on that 

15 particular issue (including free disclosure to those "trusted" candidates) would be 

1 6 advantageous or information which would be potentially harmful to pre-release to a 

17 potential adversary on that issue (which it may ultimately disclose to a candidate 

18 opposing that issue and/or modify its own political strategy in a manner which is 

19 ultimately antagonistic to the interests of the welfare of that issue or theoretically 

20 even issues which that group may wish to disclose in the interest of providing further 

21 support for the global political strategy through this form of coalition creation). It is 

22 worth noting that within the scope of the present framework it is likely that the 

23 availability of such detailed information, both predictively and at an individual level, 

24 regarding the ultimate political and economic effects which certain voting in other 

25 political support actions via SDI are likely to imply, it is extremely likely that 

26 members of each group may be reluctant to provide information, regarding the groups 

27 political strategy to public polls. 
28 

29 Thus it is a primary objective within the scope of the present system's formation 

30 of common political support strategy groups to ascertain each individuals user's ultimate 

3 1 political objectives based upon the analysis which provides analysis and prediction of the 
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1 present and likely future effects, both politically and economically from the standpoint of 

2 that individual (which may include social, professional and predicted individual investor 

3 oriented direct and indirect consequences at a detailed level as it relates to that individual. 

4 And secondly, to formulate a global political strategy based upon these objectives which 

5 are represented by an SDI agent and which is able to recommend actions and, as such, act 

6 as a coalition on behalf of the constituent individuals, and including negotiations for 

7 further coalition formation on various portions this global strategy in order to further 

8 enhances the collective power wherever common interests can be safely identified and 

9 shared to both groups' mutual best interest. 

10 The effects on commercial industry, even specific businesses in which certain 

1 1 actions are likely to result, may be modeled and presented to the user as well as the 

12 effects upon the user in light of the investment portfolio of the user. If a particular 

13 political position were to be taken, the system could even recommend 

14 ----- 

1 5 which is most compatible with a particular political position. 

16 

1 7 Continuation to Supply Chain Section 

18 It is also possible to utilize the extensive information in SDI to instruct each 

19 political support group's most advantageous supply chain structure in order to be able to 

20 recommend certain strategically advantageous cases of tax dollars which facilitate the 

21 construction of such supply chains in accordance with the above methods above 

22 described. It should be noted that a "political group" may be further subdivided into 

23 groups based upon purchase affinities (which themselves suggest different group 

24 divisions) which in turn may bribe or boycott commercial entities as needed to achieve 

25 their most advantageous strategy as consumers or simply individual users (as economic 

26 entities). Each subgroup (via SDI) may further augment their economic leverage with 

27 their political leverage (over tax allocation decisions). E.g., if candidate X supports this 

28 particular supply chain strategy and if he is elected then it is most optimally advantageous 

29 to also utilize consumer leverage in supporting transactions with certain vendors and 

30 boycotting others in order to support or complement the politically (tax) funded strategy. 
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1 As suggested, SDI may be used to facilitate the matching and strategic formation 

2 of coalitions of groups which are most likely to benefit one another on particular political 

3 issues or formation of strategic contractual relationships between commercial entities 

4 (SDI is able to determine if/when certain information is helpful or harmful to disclose to 

5 another particular entity). Each group may exert particular leverage over politicians. I.e., 

6 not only could the politicians receive feedback as to the political issues and economic 

7 (commercial) alliances which are most advantageous but also the groups could upon the 

8 advice and direction of SDI make their votes conditional upon the politician supporting 

9 certain issues or economic actions. SDI could provide real-time feedback right up until 

10 the election as to the numbers and percentages of users who had supported the candidate 

1 1 and the candidates response to them. There are other instances in which the leverage 

12 could be used to incentivize commercial entities to support certain political concerns e.g., 

1 3 if company X donates percentage of its revenues to the United Negro College Fund group 

14 Z will support the facilitation or an alliance between company and a major manufacturing 

15 firm (e.g., by putting consumer pressure upon those retailers which are the 

16 manufacturers' customers to in turn threaten temporary boycott of the firm unless the 

1 7 stipulations are met or if they are not consumers to instead offer consumer loyalty (e.g. 

1 8 through dedicated currency). 

19 A variety of applications of the present scheme are conceivable such as local 

20 politics such as commissioners office and board members delegated for purposes of 

21 masterplaning and use applications and allowing or disallowing certain real-estate and 

22 highway development projects. Certain commercial projects, for example, could be of 

23 substantial benefit to certain commercial entities which could provide considerable direct 

24 long term economic benefit to citizens in the local community even directly to the 

25 individuals in the communities (who elect those officials) as well as contribute indirectly 

26 via the available tax base. Or certain commercial entities could, for example, directly or 

27 indirectly provide economic benefit to certain individuals, e.g., if they are employees, 

28 investors or commercial entity which is in some way commercially dependent upon that 

29 company, e.g., for commercial business if it is a customer or alternatively a supplier and 

30 the impact of its resulting success in general (or even local presence) would again provide 

31 trickle-down economic benefits to that entity. In accordance with the present application 
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1 it may even be possible to estimate values for such factors as, for example, the economic 

2 impact 20 years in the future upon commercial business and industries in the local region 

3 (as well as nationally) if local schools are provided with a 10% vs. a 7 V%% share of the 

4 local tax base. With these predictive models, the associated prospectively affected 

5 commercial entities could, effectively encourage SDI acting on behalf of the local 

6 citizens to form a coalition which makes their voting of the relevant elected official(s) 

7 contingent upon the increased tax allocation for local schools (the specific preferred users 

8 could be predefined as well). SDI acting on behalf of the interests of the prospectively 

9 affected commercial entities could bribe the citizens with either (preferably) such benefits 

10 as in kind, goods/services which could in turn be leveraged through the supply chain or 

1 1 through a barter exchange system or employment benefits or wage increases (which again 

12 could be effectively treated as a barterable commodity subject to the above conditions of 

13 avoiding interactions with competitive entities. Such modelling could be extended to 

14 many other situations, e.g., allocation of funds to police, fire and EMS services, 

15 approving and providing highway improvement again may provide the above benefits 

16 etc. providing safety and preserving the health, welfare and life (in addition to 

1 7 educational quality) of a certain number of individuals (each with a certain predictively 

1 8 modeled) local economic impact. The cost versus impact affecting each citizen could be 

19 presented to them by SDI accordingly in order to elucidate their decision making 

20 processes. 

21 There may be certain instances in which users at an individual level may be able to 

22 provide predicted —this benefit can be quantified towards commercial 

23 entities or even individual users if particular events can be achieved (or particular events 

24 prevented). For example, consider a student who is determined to be capable towards 

25 achieving substantial real-world success. In one instance the student cannot afford tuition 

26 for graduate school. In another instance, the student cannot afford to continue his/her 

27 studies due to depleted financial resources. The present system may model and predict 

28 likely beneficiaries commercial or even individuals who are likely to benefit and 

29 accordingly "insure" certain long-term monetary benefits (if a commercial entities) or even 

30 personal or social benefits (if an individual). 
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1 Internet piracy of copyrighted information is costing authors billions of dollars per 

2 year. Technical experts (e.g., within SDI or outside "domain experts" which SDI had 

3 identified and presented the problem) identify a technical solution consisting of a ground 

4 breaking technology which could greatly reduce total piracy from occurring, however, such 

5 solution would require cooperation from all of the Internet server manufacturers. There is, 

6 of course, the small risk that in the relatively short-term the technology will partially or 

7 completely fail (e.g., copyright pirates find a way around the solution) before a sufficient 

8 amount of revenues are realized by the copyright owners to off-set the cost, and justify the 

9 expenses to the server manufacturers, of integrating the technology into their services. 

1 0 In addition, if copyright owners receive their rightful share of royalties, more Web 

1 1 servers will be sold in general to host such content. (Insert 2 from page 3) SDI can be a 

1 2 facilitator to enabling a beneficial solution to the industry in the following ways: 

13 1 Present estimated industry-wide revenue losses compared to the industry-wide 

14 the server manufacturers as a group sustain (based upon sales data each one 

1 5 individually and secretly submits to SDI). 

1 6 2 Also present the increased sales the industry would sustain if the copy-wright owners 

1 7 received their rightful royalties to the extent which the solution is able to provide. 

1 8 3 Provide insurance or create a future market to insure investors who would invest in the 

1 9 commercial opportunity for a share of the anticipated monetary benefits to the industry 

20 (such investors could include members of the industry). 

2 1 4 Negotiate a deal with investors and/or an insurer (the latter of which could also be SDI 

22 representing the industry). A SDI again, of course, may also represent the investors. 

23 Information which is enclosed may, for example, contain in addition to the identity 

24 of the prospective investors, the complete discussion of the economic benefits which each 

25 individual investor could potentially sustain, however, this information is disclosed only to 

26 the SDI representing the collective interests of the prospective investor coalition (a primary 

27 directive of which is to never compromise the data disclosure policies of any individual 

28 investment entity, even if such disclosure to one or more of members of the group may 

29 ostensibly benefit the collective group). It is thus the objective of this collective SDI agent 

30 to make individual date disclosure recommendations for each prospective investor that will 

31 optimize the probability that the basic framework of the collective cooperative group is 
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1 successfully created while enabling the individual entities to withheld sufficient strategic 

2 information from the group in order to optimize its own negotiating position with the other 

3 entities. For example, it would be particularly advantageous to disclose credentialed 

4 information regarding the benefits which the investor could provide to the collective group 

5 of investors (based upon the existing investor information disclosed by the associated 

6 companies) and information regarding the existing relationships which that investor has with 

7 existing companies which could, in turn, provide positive benefits to the companies 

8 constituting the global investment strategy of the group and (in the case of negotiating with 

9 each given individual investor prospect) the benefit to companies which that investor has a 

10 relationship with and made available to the present investor prospect. These companies 

1 1 may also include those which SDI has recommended to that investor as part of the global 

1 2 investment strategy for that group. 

13 User's reactions to various events and stimulate the video programming can be 

14 captured, aggregated and accessed by present viewers such video in real-time or 

1 5 asynchronously for future viewers of non-live content by user profile (or content profile) or 

1 6 by similar users to that of the profile of the user. Observing reactions/responses to political 

1 7 or ethnic jokes by that group which is targeted or other groups may be interesting to users to 

18 observe as a simple example. Reactions/responses to news or political events by different 

1 9 groups or those most effected thereby may be of interest to users. 

20 13.1.7 Off-line Variations 

21 We can allow the community dollar/iamworthit model to extend to physical 

22 transactions, for example via smart card technology. A buyer can carry a smart card with 

23 community dollars, that are loaded from the user's computer device, or from some third- 

24 party operated kiosk. 

25 For vendors who offer the integration of community dollars into their discount 

26 programs (e.g. may be members of a vendor coalition) the buyer may at any vendor 

27 terminal, kiosk or his/her PC upload targeted-vendor promotions such as digital coupons 

28 which are tailored to the users profile. This profile is downloaded to the vendor and which 

29 consists in part of the recent purchases s/he had recently made at the various vendor stores. 
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1 In another variation, the community dollars may be coded into a form which is bar 

2 code reader enabled and distributed to the buyer electronically or potentially if used in 

3 conjunction with a traditional loyalty points program, additionally printed for the buyer at 

4 the vendor's physical location (such as point of sale or kiosk), applied in conjunction with 

5 purchases at the vendors physical location. At which point typically a new coupon is 

6 reprinted containing the updated secure information pertaining to the buyer's community 

7 dollar and/or loyalty points account. 

8 In another variation, a promotion for a yearly allowance of community dollars could 

9 be printed as an advertising offer on a magazine coupon, newspaper insertion or direct mail 

10 piece which could contain a unique URL (typically the actual URL for the iamworthit 

1 1 community dollars subscription site with a unique post script as the character string ("code") 

12 identifying that particular vendor and/or that promotion) from which the buyer could 

13 subscribe to iamworthit, wherein the unique URL acts as an identifier for that particular 

14 vendor's promotional piece from which the buyer originally received the offer for his/her 

1 5 own community dollars promotion. 

16 Within SDI we could send physical solicitations to buyers, and allow buyers to 

1 7 access promotions pseudonymously. SDI can target a selected audience for each vendor. 

18 Example: An iamworthit card in accordance with the pseudonymous payment methods 

19 described above, such a card could be a direct extension of SDI into the off line 

20 environment. Buyers could use this card as an identifier such that when they travel 

21 physically from vendor to vendor, their profile data can be readily identified where data 

22 pertaining to their own behavior and policy (depending on their data release potentially part 

23 of the vendor's user profile data) is retrieved. 

24 If a smart card is used this user profile data may not have to be remotely retrieved 

25 but may be stored on local memory on the card itself along with the client-based pseudonym 

26 proxy server. In one novel variation, a card is done away with completely by virtue of 

27 revolutionary technological breakthroughs in being able to instantly and positively identify 

28 buyers biometrically using iris scanning techniques (which may in a variation be further 

29 combined with facial recognition techniques). Many vendors will wish to utilize user profile 

30 data in order to deliver promotions targeted discounts and promotions (see pending patent 

3 1 "System for Customized Prices and Promotions") 
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1 13.1.7.1 Location Enhanced SDI System 

2 The co-pending application entitled "Location Enhanced Information Architecture" 

3 (LEIA) describes an integrated advertising delivery platform which selectively targets buyer 

4 personalized advertising based upon both the buyer's personal profile and the present 

5 location of the buyer which may suggest appropriate ads from vendors which are local to the 

6 buyer, wherein user identifiers (UID's) which could include any of the above identification 

7 media provide the essential elements for this buyer targeting platform. With LEIA 

8 information providers can utilize location information, in addition to static and dynamic 

9 profiling information. The method customizes the information that is displayed on a private 

10 or public information device to the real audience in the vicinity of the device, instead of a 

1 1 predicted audience. LEIA collects an extremely detailed and comprehensive information set 

12 about the daily activities of a user, enabling enhancement of the user profile with location 

13 information and temporal activity patterns. The co-pending LEIA patent suggests 

14 appropriate application environments, for example in a smart home, an office, on a mobile 

15 shopping device, and in an automobile. A LEIA-based system stores personal information 

16 on users. 

1 7 We can extend LEIA by incorporation with the Secure Data Interchange system that 

1 8 we teach in this patent. SDI enables the user to receive the benefits of powerful and well- 

19 directed information, but within a system that respects his/her privacy requirements. The 

20 interchange acts as a secure data warehouse for users and information providers, enabling 

21 information providers to target users without revealing private information to the providers 

22 directly. As described in LEIA, the privacy architectures provided for in LEIA and SDI are 

23 all the more critical as components of an SDI enhanced LEIA system architecture given the 

24 extreme sensitivity of personal location data which must be securely protected in both real- 

25 time and within an asynchronous context. 

26 LEIA customizes information that is displayed to an information recipient based on 

27 object profiles and physical location of users. Presents the information most relevant to the 

28 REAL audience, not a predicted audience per se. Because of LEIA's ability to combine 

29 user's preference information with information about their current physical locations and 

30 deliver advertising which is both of personal and location contextual relevance to the user, 

31 LEIA provides a valuable enhancement to the targeted advertising services provided by 
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1 iamworthit. For example, instead of advertisers being limited to accessing desired 

2 preference attributes of users in the pseudonymous user database exclusively they may 

3 additionally access prospective target recipients by present (or anticipated future) location 

4 parameters of users connected to mobile and even terrestrial communications networks. 

5 In accordance with the iamworthit targeted advertising platform either advertisers 

6 select user profile attributes which they are interested in or the system matches their ads 

7 automatically to the most relevant users. The former application may be performed either 

8 within the context of the pseudonymous user database which the vendor queries or targeting 

9 rules which s/he provides which control the advertising and pages which are dynamically 

10 generated for each user based upon his/her user profile. Whether the user's location is 

1 1 detected on a mobile network or a terrestrial network including telephone or cable TV, 

12 LEIA's location features are able to provide additional useful features regarding users to the 

13 vendor's rules interface or to the pseudonymous user database controlling dynamic page 

1 4 generation at his/her site. 

1 5 For example, at a bookstore, we can recommend isles and particular books; at a 

16 supermarket, can play music preferences; smart-radio, play appropriate channels in a cab 

17 based on target object profiles (as meta-data). As suggested in issued patent "System for 

1 8 Broadcast of and access to Video and Other Data Using Customer Profiles" appearance of 

1 9 relevant selections can be continuously scanned for, dynamically selected and presented to 

20 the buyer in the form of "Virtual radio station". Such a system can also be linked to a 

2 1 service for making an instant purchase, or linked to a database (in conjunction with LEIA) to 

22 recommend where a buyer should physically go to make a purchase. For example, music 

23 selections that the buyer is presently listening to may be ordered. 

24 Also, it is possible to provide advertising which is targeted to a buyer by 

25 automatically recognizing pre-existing commercials and replacing them with targeted 

26 counterparts. This can be done through the identification of previously played commercials 

27 for example, commercials that have been manually identified and classified. Upon 

28 recognition, targeted commercials (including those which are targeted by buyer location in 

29 accordance with LEIA) may be inserted into these spots, and delivered and/or pre-cached 

30 through cellular, satellite or radio communications. 
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1 At a public phone we can identify a buyer with his/her calling card, and deliver 

2 targeted advertisements, via the public telephone readout or delivering the targeted ads as 

3 audio messages in which server software at the phone switch (an ISP level proxy) 

4 recognizes iffwhen the buyer is put on hold and delivers audio and/or audio/video 

5 advertising to the buyer accordingly. 

6 Targeted discounts and advertisements can be delivered at kiosks, for example using 

7 a credit card/smart card/other ID method (e.g. biometric.) Similarly, we can use credit 

8 cards to deliver targeted print advertisements on the backs of purchase receipts, e.g., 

9 supermarkets or fueling pumps or, alternatively, on a sheet dedicated for a advertising 

1 0 purposes conjunction with public copiers or printers or in another variation, on the cover 

1 1 sheet of incoming faxes which are sent to the buyer's fax machine or in which the buyer is 

12 otherwise identified automatically from the recipient's name field on the cover sheet. 

1 3 One application includes "Smart Home Intelligence", where methods are disclosed 

14 by which users' real-time behavior may be profiled through their movement throughout 

15 their home, and specific interactions with the various network enabled appliances 

16 throughout the home. Other inputs may include the user's speech patterns (using voice 

17 recognition in combination with text analysis). It could for example, note the user's speech 

18 content patterns in real-time. Such information provides invaluable clues as to the user's 

1 9 present activities, mood and interest state and may be processed by the presently described 

20 algorithms tuned with location/time features typically using the assistance of human data 

2 1 analyst to identify the key features and correlations. (This information may also provide 

22 enhanced information pertinent to the user's general, static preferences as well). 

23 Other extensions of this scheme are also considered e.g. within the context of the 

24 user's office, or automobile and pedestrian activities. This application may thus extended 

25 the usefulness of the iamworthit model to advertisers in being able to target users through 

26 the presently anticipated on-line media as well as networked appliances and in either case, 

27 based upon the relevant context of users' present activities and behavior (and from this 

28 potentially their inferred moods or mental states) within their homes and elsewhere. 

29 Additionally (perhaps most importantly), LEIA provides a highly beneficial value to 

30 vendors whose customers purchase primarily from the vendor's bricks and mortal store 

3 1 presence. SDI can identify multiple vendors which share a common physical location. 
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1 Additionally, LEIA can also identify the physical location of buyers on mobile and 

2 terrestrial networks and suggest the sharing of off-line prospects for these off-line retailers 

3 on the basis of not only interest but physical location. 

4 1 3.1 .7.2 Digital Set Top Box Methods 

5 Similarly, we can deliver targeted advertising and other information through cable 

6 TV systems, as described in the issued parent patent application entitled "System and 

7 Method for Scheduling Broadcast of and Access to Video Programs and Other Data Using 

8 Customer Profiles" US Patent # 5,758,257, and co-pending application entitled "Broadcast 

9 & System for reduced memory terminals broadly address the use of cable systems as an 

10 interactive medium (in a bi-directional network architecture) for purposes of delivering 

1 1 targeted advertising targeted advertising and other information to the consumer based on 

12 user profiles". 

13 In this system customer behavioral data is collected at the digital set top and the 

14 upstream channel enables these profiles to be processed at the lead end server. These 

1 5 detailed profiles may then be subsequently transmitted down and stored at the level of the 

1 6 individual set top. The cable environment is a two way interactive medium. The bandwidth 

17 allocation is inherently asymmetric. Separate channels can push parallel adverts, which are 

1 8 selected at the set-top-box according to a buyer's profile. Each channel can have associated 

19 meta-data to allow matching at the set-top-box. As an alternative variation, full motion 

20 advertisements may be down loaded in the form of applets to the digital set top box and 

21 displayed to the buyer in similar fashion as described above. Or full-size commercials or 

22 infomercials could even be downloaded and inserted in place of existing commercials on the 

23 video stream. Digital tags used for queuing for ad insertion technology enable this 

24 capability. The preferred commercial implementation of the system is used within the 

25 context iamworthit (the buyer infomediary service in which the viewer receives value for 

26 advertising received). This method is preferred (in as much as in the following alternative, 

27 the cable operator does not need to be in the loop). With that said, in an alternative 

28 embodiment, however, requiring the cooperation of the cable operator various commercials 

29 could be broadcast in parallel during commercial breaks. The user profile at the set-top is 

30 deployed to select the most appropriate commercial accordingly. 
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1 User profiles regarding viewing behavior can be collected with relatively minor up- 

2 stream transmission, e.g., to infer whether a buyer is viewing a program the set-top may 

3 transmit the viewers current viewing selection, e.g., two minutes after the beginning of each 

4 half hour and two minutes before the end of the half hour (and possibly at additional 

5 intervals during the viewing segment as well). In the point-to-point access and delivery of 

6 personal information as well as the passive monitoring of viewing behavior (for profile 

7 generation at the head-end and subsequent user profile delivery for the set-top) because of 

8 strict buyer privacy regulations in the cable/satellite industries the use of the pseudonym 

9 proxy server architecture (as described in the parent case and integrated into the above 

1 0 referenced case) is extremely important. 

1 1 Once interactive television is enabled by access to very large amounts of bandwidth, 

1 2 it will be very feasible from a bandwidth utilization standpoint to transmit more interactive 

1 3 content. Predictive caching is still extremely advantageous in as much as advertising (unlike 

14 interactive video real-time on demand video content) tends to be controlled by targeting 

1 5 rules of the advertiser rather than the user who receives value in exchange. Of course, radio 

1 6 or music content may also be delivered in conjunction with preloaded audio commercials. 

1 7 And any of the above content is deliverable over the same networks to other appliances such 

18 astheP.C. 

19 13.1.7.3 Optical-Based System Extension 

20 Iamworthit in a future version of the system could incorporate a novel optically 

21 based medium for delivery of the advertising. The protocol would enable the delivery of 

22 user targeted advertising in a non-electronic environment directly to the user from a static 

23 source in visible contact of the user. The system incorporates the following aspects: The 

24 user wears glasses or contact lenses which contain a Polaroid film in which the optical 

25 medium is polarized for only certain very narrow and specific wavelengths. 

26 Advertisements (or other messages) may be presented from signboards which display from 

27 the same surface ads appropriate for every different user profile segment for which a unique 

28 message is appropriate. The Polaroid film for each user is unique to the user segment to 

29 which they belong. Each user segment's optical film filters out all of the particular 

30 wavelengths of all other messages except that which is appropriate to the user segment to 
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1 which that user belongs. Alternatively, the current application may also be relevant to 

2 virtual tags as users may provide instead optical messages specifically targeted by a user to 

3 other users characterized by their profile features which establishes the entire criteria for the 

4 user segment they fall into for which an appropriate message may be targeted. 

5 13.2. Interactive Data Mining Applications 

6 In this section we describe interactive data mining applications, which involve 

7 dynamic two- or multi-way communication between agents. For example, within the 

8 context of a vendor-consumer interaction, the key difference is that information flow is 

9 not one-directional. An important example is time-of-purchase, in which the consumer's 

10 agent initiates a request for a response (with a counter bid) from a number of vendors. 

1 1 The vendors can access profile information about that consumer, and then make 

12 personalized responses. In general, this bid-response processes can iterate, with 

13 vendors/consumers exchanging information over multiple rounds. The exchange of 

14 information may occur within SDI in initial stages, with the system acting as a trusted 

1 5 intermediary until the agents are ready for an introduction. 

16 We describe time-of-purchase competition, and then move on to general match- 

1 7 making applications, which allow interested parties to exchange information by mutual 

18 consent (as indicated in SDI rules), with initial information exchange autonomous. There 

19 are a number of interesting match-making applications, for example within a system for 

20 sensitive negotiation, introductions. The system of SDI can play the role of a trusted 

2 1 intermediary, so that only suitable introductions are made. A vendor can place a request 

22 for a certain type of meeting without its competitors knowing that it has made such a 

23 request. Finally, we describe a "real-time experts market", in which experts can respond 

24 for payment to questions placed by other agents. The system of SDI allows a useful 

25 matching of agents and experts, and also allows data mining to check on the feedback a 

26 particular expert has received from previous clients. We also suggest an application of 

27 interactive SDI-based data-mining to resale markets, and to a decentralized transportation 

28 management system. 
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1 13.2.1 Time of Purchase Competition 

2 One application of the system of Secure Data Interchange is in "time of purchase" 

3 competition, where by a networked system of vendors can subscribe and receive the 

4 opportunity to place counteroffers to users that are about to make a purchase. This 

5 application is enabled within SDI because the user agent can remain anonymous while 

6 receiving counteroffers, but still use the system of SDI to provide profile information that 

7 can allow vendors to make attractive personalized offers. The client-side SDI proxy notifies 

8 a central SDI '"time of purchase" server, which can: 

9 Determine appropriate vendors to prompt with an opportunity to make a counteroffer. 

10 Create a one-time anonymous identifier for the user, linked to an appropriate set of 

1 1 profile information, and allow vendors to execute queries under that profile. 

1 2 Collect counteroffers, and pass them to the user's client machine. 
13 

14 We also describe how to generate coalitions dynamically, based on the privileged 

15 position of the time-of-purchase server in the marketplace. The time-of-purchase server can 

16 transparently bundle purchases from disparate end-users, and negotiate quantity discounts 

1 7 with vendors. Buyer Coalition formation can significantly improve negotiating leverage as 

1 8 the collective buying power of numerous individuals using the time-of-purchase competition 

1 9 feature, and be made completely transparent to users. 

20 Time-of-purchase allows a user to elicit dynamic market competition between 

21 vendors. For example, when a user browses a book at Amazon.com if she/he chooses to 

22 activate the time-of-purchase competition feature from the browser, a notification will be 

23 sent-out in real-time to all competing book stores which also offer that product section. The 

24 notification contains the product code and price quoted to that user at Amazon. The 

25 electronic agent at the vendor's site has an opportunity to counter Amazon's offer to the user 

26 (typically on the basis of price, however, other parameters may be used as well). The user 

27 may then select the offer which is most competitive or attractive, including the original 

28 offer. 

29 Integration into a standard shop-bot interface e.g. with comparative features of the 

30 products associated with the various offers across a variety of product criteria, or at the least 

3 1 price can be used to provide the user within simple recommendations. 
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1 Another extension is to propose a special interest-bearing bank account which we set-up for 

2 the user which is tied into a wallet or debit/credit card for off-line transactions. We can 

3 allow a user agent to bundle its own purchases over time, and use the system of SDI to 

4 guarantee future payment to on-line vendors in return to a good current offer. This is novel, 

5 because it is like participating in a "discount scheme" without purchasing the right to future 

6 discounts up front. A user can still invest money that is earmarked for future purchases with 

7 a vendor. 

8 13.2.1.2 Description 

9 Figure 17 illustrates the overall architecture of this on-line B2C e-commerce 

1 0 application of the methods of SDI. The agent represents a user with an SDI-enabled client 

1 1 machine, the vendor a server computer, also SDI-enabled. Agents interact with vendors both 

12 directly and via the central SDI server, and other components. We illustrate the time-of- 

1 3 purchase server, which receives requests from user's client machines as purchases are about 

14 to be made, and forwards those requests via the SDI central data warehouse to vendors. 

1 5 Client-side proxy agents monitor the interaction between a user agent a vendor, and can be 

1 6 configured to automatically notify other vendors when a purchase is about to take place. The 

17 key problem that we address to implement a system for time-of-purchase competition is 

1 8 how to identify the type of purchase that a user is about to make, automatically. This makes 

19 the operation of time-of-purchase seamless to a user, the system simply requires that the 

20 user waits for a period of time while competitive bids are collected from other vendors. 

2 1 We propose two variations: 

22 a) The vendor with which the user is about to make a purchase subscribes to SDI 

23 and provides information to the client-side proxy about the user's interaction. 

24 b) The vendor is un-cooperative, and the client-side SDI proxy needs to use other 

25 techniques to classify the type of product or service that a user is requesting from 

26 a vendor. 
27 

28 The central SDI data warehouse maintains a searchable index of vendors that 

29 provide certain products and services, and sending buyer-profile and purchase information 

30 to appropriate vendors just before a purchase is made. Vendors can register with SDI to 
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1 receive time-of-purchase competition opportunities, and can also register to allow time-of- 

2 purchase competition with purchases on their web pages (case a, above). Of course, it is 

3 unlikely that a vendor would do this without incentive, perhaps one incentive could be a 

4 discounted price for registering to receive opportunities for time-of-purchase competition. 

5 In fact, this problem of client-side identification of transactions is a key problem that 

6 must be addressed in a description of client-side user profiling and the submission of 

7 information to SDL The system of SDI requires a method to know what the user is doing. 

8 Click streams, typed query information, provided profile information etc. 

9 A vendor that registers to receive time-of-purchase competition opportunities agrees 

10 on a standard classification system for products and services, and to state what types of 

11 products and services it wishes to enable for time-of-purchase competition. The 

12 classification system encodes the product or service that a user requests, at an appropriate 

1 3 level of detail to allow other vendors to make reasonable counteroffers. 

14 The iamworthit central server maintains a database of vendor interest sets, so that it 

15 can determine from a classification code which vendors will be interested to provide 

1 6 counteroffers. 

1 7 For example, if the product is "flights to the UK" then the code might encode the 

1 8 dates that the user wants to travel, and his/her preferred departure and arrival airports. 

19 Take case (a) , where we assume that a vendor is cooperative and provides the SDI proxy 

20 agent that implements time-of-purchase for the user with a final product/service code and 

21 price. The system works as follows: 

22 1 . User enters into a dialogue with vendor. 

23 2. Vendor makes "final offer" to a user, and the user is about to make a purchase. 

24 The vendor has also informed the user of the product/service classification code 

25 that encodes relevant information about the type of product the user wants to 

26 purchase. 

27 3. User clicks on 'time of purchase competition option' on its local SDI-enabled 

28 client. 

29 4. Client sends message to the SDI time-of-purchase competition server with (User 

30 profile information/identifier, one-time time-of-purchase ID, product/service 

3 1 classification code) 
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1 5. Time-of-purchase competition server looks up the product code, and sends the 

2 user profile identifiers and a time-of-purchase ID to relevant vendors. 

3 6. Vendors can construct competitive offers based on profile information accessed in 

4 the central SDI data warehouse, and information about the existing offer. 

5 7. Vendors send the counteroffers to the time-of-purchase server, which forwards 

6 the offers to the user's client-side SDI proxy. 

7 8. The user is presented with a set of alternatives, and can make a final purchase 

8 decision. 

9 9. The final purchase is completed with the appropriate vendor. 
10 

1 1 We allow other vendors to access profile information about a user based on the 

1 2 profile information that the user provides to time-of-purchase, i.e. based on the profile which 

1 3 it wishes to use for the purposes of having appropriate counteroffers constructed. At this 

14 stage the client-side SDI proxy agent might also send additional profile information to be 

1 5 stored within the SDI data warehouse. 

16 It is useful for vendors to access profile information about a user (anonymously or 

17 pseudonymously, as defined by the user's proxy agent) because products and services can 

1 8 be configurable goods or services, and a vendor can compete on more than price alone, but 

1 9 also in other dimensions. 

20 The one-time time-of-purchase ID is constructed client-side in step (4) to act as a 

21 temporary identifier for the purposes of soliciting competitive responses from vendors. 

22 Vendors can be prevented from making direct offers to users, all counteroffers must be 

23 forwarded though the time-of-purchase server. 

24 Figure 18 shows the offer process in time-of-purchase competition, with an offer 

25 from a vendor that a user interacts with via an SDI-enabled client triggering a set of new 

26 requests for counteroffers from other vendors. The 'offer' message from Vendor 1 includes 

27 a code about what type of product or service the user is requesting, and is passed to the other 

28 vendors. The 'request counteroffer' from the client to the iamworthit server contains 

29 information from the user to enable the other vendors to make personalized responses, based 

30 on profile information stored in the central SDI data warehouse. After the 'request for 

3 1 offers' are made, the vendors compute responses and (optionally) access profile information. 
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1 Finally, vendors make responses which are collected by the time-of-purchase competition 

2 server and forwarded to the user client. 

3 We can also allow vendors to pop up competitive offers to other offers the user 

4 sees on another competitor's site or ad banner in real time, even if the user is not about to 

5 purchase the product. For example, if a user is shown "get this CD for $15, then a 

6 competing offer of no, get the CD for $14 can pop-up". A simple extension simply has 

7 the user also receiving a pop-up banner with a competing price if they are viewing a 

8 competitors offers in the virtual store of that competitor. We might also report to a vendor 

9 if, when, and to what degree a customer is becoming disloyal to a particular vendor and 

1 0 taking alternate offers. 

11 13.2.1.3 Applications 

12 There are numerous example commercial applications in which the time of 

13 purchase competition features can be utilized to provide significant benefit to the user. 

14 Some the less obvious commercial domains include: 

15 1). Travel Packages and Flight and Hotel Rates - Features, amenities, class, activities, 

16 entertainment, excursions, etc. 

17 2). Financial Services (e. g. , Loans and Mortgages) - Terms, amortization, interest 

1 8 rates, service fees, etc. 

19 3). Automobiles - Luxury and safety features, warrantees, auto club membership, 

20 terms of purchase, etc. 

21 4). B2B Commerce - Numerous commercial applications including standard Web- 

22 based environments for B2B e-commerce including buyer-side and seller-side e- 

23 commerce applications, vertical B2B portals and trading floors. 

24 5). Job Positions - Employers posting job positions may be notified by the service of 

25 competing job offers which employment candidates are browsing ( and ultimately 

26 accept if this information is indicated by the candidate). Resumes and, if desired, 

27 other profile information is provided to all competitors. Depending upon the profile 

28 of the candidate and/or competitive employment opportunities which a particular 

29 candidate is observing (or has recently observed), employers may wish to customize 

30 offers dynamically on an individual basis. 
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1 13.2.1.4 Integration into a Shopbot Interface 

2 We also suggest the integration of time-of-purchase competition into a standard 

3 shop-bot interface e.g. with comparative features of the products associated with the 

4 various offers across a variety of product criteria, or at the least price can be used to 

5 provide the user within simple recommendations. With this, even without a time-of- 

6 purchase offer the user can at least compare its offer with fixed-price deals in the wider 

7 market place. We might also provide a user with historical information about previous 

8 (low) price offers, within a Yahoo-like portal for e-commerce; to provide a user within 

9 information about a reasonable price for each individual purchasable; i.e. the very lowest 

10 price that each given item had been offered to a customer previously using time of 

1 1 purchase competition. This site could also act as a stand alone proxy server (like 

12 anonymizer) which inserts this information by overlaying it on Web pages as the user 

1 3 browses the web. 

14 The time of purchase competition feature is designed to be accessible to a user 

1 5 within the context of or most any information access mode relating to browsing on the 

16 Internet e.g. Web or portal browsing, receiving email or "push" content, submitting 

17 queries (e.g. specific names, categories and/or desired features (or combinations of the 

1 8 above) of products which the user identified as being or present interest. 

19 During any of the above information access modes when a buyer specifies time-of- 

20 purchase competition at her SDI client proxy, SDI can automatically provide competitors 

21 with information about a buyer's product or service requirements, and a buyer's profile, as 

22 well as on particular vendor specific offers the buyer is observing or contemplating before a 

23 buyer makes a purchase. This can facilitate competition between vendors, and can lead to 

24 better prices and offers for buyers. The system of time-of-purchase competition can also 

25 help to reduce the costs of entry into a market, because name-recognition becomes less 

26 important. New vendors can simply register with time-of-purchase referral system, and 

27 cherry pick a small set of purchases. 

28 This is a next-generation e-commerce service. Current shop-bots, for example 

29 "Junglee" at Amazon.com and www.shoptheweb.amazon.com provide a static comparison 

30 shopping service. A buyer can specify a product, and receive price information about the 

3 1 product from different suppliers. There is no dynamic competition between vendors on price 
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1 or features. The buyer driven service for flights offered by www.priceline.com is more 

2 dynamic, in that a seller is found to match the price that a buyer bids, but does not 

3 necessarily promote competition between sellers for a user's purchase. In fact the sellers can 

4 make excess profits from the pricing errors made by buyers, because once a buyer states 

5 his/her reservation price, that is the price he/she must pay. Surplus goes to the market 

6 operator, e.g. priceline, but not to the user. The system is also different to our proposed 

7 system in that: 

8 a) Time-of-purchase competition is disintermediated and decentralized, 

9 implemented dynamically over a virtual network of competing vendors. 

10 b) Time-of-purchase also allows a vendor to access anonymous profile information 

1 1 about a buyer, to enable a careful tailoring of an offer to the preferences of a user. 
12 

13 We can use profile information, and historical transaction information for similar 

14 transactions, together with the customer price/promotion algorithm disclosed in co-pending 

1 5 patent "System for the Automatic Determination of Customized Prices and Promotions" to 

16 negotiate on a deal with a vendor that will optimize the value to the buyer. Profiling of 

1 7 vendors, and buyer transactions, can allow buyers to avoid making bids that are too high and 

1 8 losing value (airlines in priceline.com can profit from inaccurate buyer bids). 

19 Time-of-purchase enables vendors with competing products or services to receive 

20 automatic notification when a buyer is about to purchase a relevant product or service. A 

21 vendor can also receive information on the profile of a buyer, and the offers made by other 

22 vendors; and submit counter-offers to a buyer via the buyer's SDI-enabled client. The buyer 

23 can then be presented with a final set of offers, before making a purchase decision. 

24 A buyer can also configure its profile management rules within time-of-purchase to 

25 provide profile information relating to the buyer's sensitivity to discount offers, customer 

26 loyalty with other vendors, value responsiveness (bargain driven), responsiveness to high 

27 quantity discounts (for only those categories which the buyer makes frequent or large 

28 purchases), etc. This information can be certified by the buyer's client-side SDI proxy, as a 

29 fair representation of the purchasing habits of a buyer, and can be aggregated across product 

30 types to protect a buyer's privacy. 
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1 The SDI time-of-purchase server can identify vendors with similar products or 

2 services, either using a static index which maintains vendors in particular product domains, 

3 or through dynamic profile matches between the target object profile of the web site that the 

4 buyer is currently browsing and target object profiles of the web sites of other SDI-enabled 

5 vendors. Alternatively, perhaps classification and clustering techniques can be deployed to 

6 identify similarity between vendors at the level of target objects, i.e. the products that a 

7 vendor offers; as indicated by virtual tags provided by a vendor that profile its products and 

8 services. 

9 The time-of-purchase competition system may also draw in static "listed" offers on a 

10 vendor's web site, to prevent a vendor over-charging a user. In addition, SDI might track 

1 1 purchases and via clustering techniques request prices on similar or competing products to 

12 the product that a user is about to purchase. These products and/or services may also be 

13 released to a user. When presenting a choice of products and/or purchases to a user we can 

1 4 allow many features to be summarized, for example features which relate to the location of a 

1 5 vendor, terms of shipment and insurance, and user ratings. 

16 Vendors are notified, and provided with the ability to access the profile of the buyer 

17 (as the buyer deems appropriate), either with client-level processing or through the release 

18 of an anonymous profile to the vendor. Vendors typically will wish to construct offers 

19 through a rule-based engine, data-mining techniques, or automatic collaborative filtering 

20 techniques, as disclosed in co-pending patent application "System for Automatic 

21 Determination of Customized Prices and Promotions" and U.S. Patent #5,754,939, 

22 "System for Generation of User profiles for a System for Customized Electronic 

23 Identification of Desirable Objects" as such techniques may be deployed by the vendor 

24 directly or via the Secure Data Interchange representing the interests of the vendors. 
25 

26 Choosing an Offer to Make 

27 User profile information may include a temporal profile of the buyer's present 

28 activities, including search terms, recent page navigations, what pages is the buyer 

29 observing presently (and the profile of this page) or even his/her present physical location as 

30 well as the general user profile. Any portion of the above particularly the latter two may of 
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1 course be withheld from the vendor). Or the buyer may be more generous in providing this 

2 information about the various static or dynamic features. 

3 Vendors can target buyers on the basis of their preferences and interests, and also 

4 within the temporal context of when they are most likely to be receptive to offers (thus 

5 providing also a benefit/service to the buyer as well as enabling the competing vendors to 

6 increase the price point at which their competition begins, knowing that the buyer is in an 

7 optimally receptive condition to accept offers for that item). For example, a user might 

8 release to a vendor the current context of a user's purchase, such as the click stream over the 

9 past 5 minutes. 

10 In the preferred implementation, vendors are also provided with a (client or web- 

1 1 based) rules interface which enables the vendors to input pre-stated rules with which the 

1 2 system may solicit and respond to competitive offers automatically. In this way data mining 

1 3 may be performed in order for the vendor to determine what a user (or all users) by attribute, 

14 tend to best respond to by product, product feature, features or services of the vendor, price, 

1 5 etc. If pre-stated rules are used to automatically respond to a notification with a competitive 

16 offer, the nature and degree of discount is typically determined in accordance with the 

1 7 nature and degree of the original or previous offer and/or the user profile as disclosed by the 

1 8 client-level proxy/server to that vendor. In lieu of manually entered rules, co-pending patent 

19 application entitled "System for the Automatic Determination of Customized Prices and 

20 Promotions" another similar algorithmic methodology may be used as an aid by the vendor 

21 in order to automatically determine a competitive offer (or subsequent responses thereto) as 

22 well as an aid to the vendor in selecting optimal rules. 

23 These techniques can also be used such that predictive metrics as to selection, price 

24 and promotional type, may be optimally determined in relation to the individual buyer or 

25 specific relevant user profile attributes, for example, in accordance with a data analysis 

26 expert of the vendor (or representing the vendor via SDI) analyzing randomized versions of 

27 user profiles and randomized aggregate statistics. These metrics may be linked with actual 

28 behavioral statistics of the buyers on the vendors' web site. In order to further drive 

29 competition between vendors, the time-of-purchase system can provide each competitor 

30 details of the final transaction which the buyer chose to accept. Accordingly, in an attempt 
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1 to better improve their competitive advantage in the competitive bidding process a vendor 

2 may also wish to analyze a user's purchases and adjust its future offers to similar users. 
3 

4 Choosing a Profile to Release 

5 From the buyer's perspective, there is an elaborate methodology for the automatic 

6 determination of the particular buyer information which, if released, will elicit the most 

7 optimal offers from vendors which can be determined by utilizing that same type of pricing 

8 model which the vendor is likely to use (for example that same method deployed in co- 

9 pending patent entitled "System for the Automatic Determination of Customized Prices and 

1 0 Promotions" an instead be tailored to determine which metrics in the user's profile decrease 

1 1 the overall price point and/or increase overall value over an "average" user (with no profile 

12 disclosure). 

13 It is even conceivable that vendors could (upon user permission) be allowed to 

14 directly identify plus enter into a live discussion with the user during the time-of- 

1 5 purchase competition process. A variation of the system was even is further disclosed in 

16 the above co-pending application which was intended (according to the above referenced 

17 co-pending specification) as an electronic assistant to tele-marketers and other sales 

18 persons to determine offers and counter-offers which are automatically generated in 

19 response to (for example) rejections of the previous offer as well as counter offers by the 

20 buyer. This dynamic system was originally designed for salespersons to optimize the 

21 expected profit from each customer (in view of the general user profile and the offer 

22 buyer responses up to that point in the negotiation). As such, this technique could be 

23 readily extended to the current application in which the previous offers up to that point 

24 may instead originate from other vendors (instead of a single one), thus the system 

25 responses may be affected by the user profile as well as the offer response pairs up to that 

26 point in the negotiation process. 

27 It is likely that vendors will not compete on price alone, but rather through added- 

28 value services such as offering loyalty bonuses, cross-sells, and two-for-one offer and added 

29 features as well as service advantages such as fast delivery, guaranteed service/maintenance, 

30 warrantees etc. Vendors will choose this mode of selling to prevent simple price- 

3 1 comparison at the client. Conversely, in some instances vendors may attempt to eliminate 
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1 the features in order to create the perception of a better deal through marginal price 

2 reductions, then possibly add these features as counter offers to other vendors in which the 

3 same price may be maintained. 

4 Accordingly the above referenced algorithm as provided by the customized prices 

5 and promotions scheme, may offer valuable competitive insights to the vendor in being able 

6 to assemble more appealing cross-sells/up-sells and more complex offers (including 

7 multiple items which are adjusted by the system to be extremely desirable to the buyer) 

8 more effectively than the competition. Therefore the client will receive offers from multiple 

9 vendors, and after initial filtering of the large collection of offers, present a choice set to the 

1 0 buyer. 

1 1 SDI can also present a buyer with ratings or annotations for items across multiple 

12 competing products, for example, within the context of buyers whose attributes (which may 

13 be important pricing features) are similar to that of the buyer. SDI might also monitor 

14 previous offers made by vendors, so that if a better offer was made in the past a buyer can 

1 5 request the lower price, and try to bargain with the vendor. Typically the resulting purchase 

16 recommendations are presented as suggestions on the user interface to help the user's 

1 7 assessment of valuing the offer as a "good deal" as well as to accurately represent his/her 

18 own preferences. Some users may, however, use this feature as part of a fully automated 

19 (agent mediated) buying process which is performed in asynchronous fashion (as described 

20 below, this typically is only necessary if the time-of-purchase system is deployed in order to 

2 1 create buyer coalitions prior to negotiating with the vendors. Of course the time of purchase 

22 competition enabled buyer will distribute the offers in order to elicit vendor competition. 

23 SDI can also present a buyer with previous time of purchase vendor competition 

24 negotiations e.g. offers made in the past by vendors for similar products as those provided 

25 by that prospective advertiser (even from the same vendor) which are at least as competitive 

26 as offers made to that user or "similar" users. Because of the competitive nature of this 

27 environment, vendors will wish to leverage all of this information and using their own 

28 pricing algorithms and data mining tools, craft optimal offers through the pseudonymous 

29 buyer database. 

30 If the buyer so desires, the client-level proxy can also automatically notify these 

3 1 vendors if7when a particular offer is about to be accepted by the buyer. For example, a time 
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1 delay response in the client-level proxy actually processing the order requests could allow 

2 vendors a final opportunity to present another competitive offer to the buyer. In another less 

3 optimal variation, vendors are notified only upon the buyer agreeing to accept an initial offer 

4 received. A secure function within the buyers client residing within the Secure Sockets 

5 Layer (SSL) is able to authenticate to the vendor's receiving the notification that the offer is 

6 true and untampered by the buyer through the transmission of a secure encrypted message. 

7 As an additional service to buyers the SDI-level proxy server can perform analysis on the 

8 offers that a buyer receives, through comparison with offers that have been received by 

9 other buyers with the best offer that has been received by any buyer for the same product, 

1 0 and with the typical offer received by a buyer with a similar profile to the buyer. This can be 

11 useful to a buyer because it will allow the buyer to reject all offers if they are non- 
12 competitive. The SDI-level proxy could also automatically identify for buyers the profile 

13 attributes that promote good offers, and the profile attributes that promote bad offers, as an 

14 informational service to enable buyers to gain better offers in the future, either through 

1 5 revealing certain information or changing behavior to attain favorable profiles. 

16 If iamworthit detects that discrepancies in the pricing for a given vendor's item 

1 7 occurs in correlation with certain identifiable user profile information (not arbitrarily), it will 

1 8 in the future initially disclose that data of any other which is determined from past buyer 

19 interactions to be advantageous for the buyer. Should the vendor exhibit price or offer 

20 discrimination (i.e. by having provided more favorable items to other previous buyers, the 

2 1 buyer may demand the lowest price. If the vendor ignores the request and the user has not 

22 received (or does not wish to accept) another vendor's offer for a lower price and the 

23 iamworthit infomediary service may apply pressure upon the vendor such as creating a 

24 buyer coalition and/or making a believable threat (as described below) which has the ability 

25 to deny future business to that vendor. If the vendor is able to present an exceptionally 

26 attractive offer the buyer may instead wish to stipulate that the offer is valid only if the time 

27 of purchase competition feature is not used by the user, iamworthit' s statistics based price 

28 prediction scheme can then predict if this decision would serve the user's best interests or 

29 not based upon the user's purchasing objectives. 

30 Furthermore, we can allow vendors to offer payment to a client in return for 

31 displaying an offer to the buyer, and vendors can also bid for space on the buyer's web 
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1 portal which is often represented as a profile associated with a pseudonym in conjunction 

2 with a description of the ad space. The purchasing decisions of the buyer may be performed 

3 by an electronic representative of the buyer's wishes (as "buyer agent") implementing the 

4 techniques of pricing/promotion selection algorithms completely autonomously on behalf of 

5 the buyer. However, the best offer can only be presented to a buyer to the extent that the SDI 

6 client level software understands a buyer's model of 'Value", and can make appropriate 

7 tradeoffs between product features and price (as implicitly inferred by the system through 

8 the above suggested techniques or explicitly stated by the buyer in advance). Nonetheless, 

9 this is a hard problem, and we expect that the buyer will often need to make a final product 

10 choice decision which could be a default in which the system would defer judgment to the 

1 1 buyer if it's statistical confidence as to the buyer wishes below a certain threshold. 

12 The collaborative filtering techniques described in pending patent "System for 

13 Automatic Determination of Customized Prices and Promotions", can allow a buyer's 

14 client-level proxy server, termed the buyer agent in this section, to automatically analyze 

1 5 offers. The system can also be used to send initial offers to vendors, on the basis of historical 

16 information about the transactions that have been performed between other buyers (which 

1 7 include the benefit of his/her complete user profile data) and the vendor. Offers can (of 

1 8 course) be sent to a vendor and its competitors. Finally, after offers that are received from 

19 vendors are pre-screened, they can be automatically ranked for value — using a combined 

20 quality and price metric (again judged within a collaborative filtering framework). Buyer 

21 feedback e.g. average ratings and annotations from previous buyers may form an additional 

22 criteria. The goal is to leverage the database of other offers that have been accepted by 

23 buyers in the past, and form a model of vendors, to determine whether or not a buyer has 

24 received good offers (i.e. we can exchange information within the system of Secure Data 

25 Interchange, and making more information available increases the efficiency of the market). 

26 Offers can be filtered and presented to a buyer in rank order. 

27 1 3.2.1 .5 Time-of-Purchase with Non-cooperative vendors 

28 We suggest a simple technique to allow a client-side SDI proxy to automatically 

29 detect that a purchase is about to be authorized, and the details of the purchase. The model is 

30 that SDI, or a third-party SDI client, provides a banking service for a user. The advantage of 
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1 such a service is that in making a purchase, we can require that the vendor provides 



2 


information to the bank about the purchase that is about to be made, as a condition for 


3 


validating payment. The system works as follows: 


4 


i . 


user interacts wun vendor, ana decides to mdKe d purcnase. 


5 


z. 


user provides payment metnou, lor example an oui DdnK dccount numoer. 


6 


-5 
J. 


i ne oUi Dame account server ciemanus lnioimdiion doout tne product wnicn is 


7 




aoout to oe purendseu oeiore dutnonzing pdyment ior tne service. 


8 


A 


i ne vencior provides tne oui udnK wiui liuoniidtion duout tne purendse. 


9 


5. 


The SDI time-of-purchase system can now step into the transaction, and 


10 




solicit competitive offers from other vendors. 


11 


6. 


The user is presented with the opportunity to select an alternative offer. 


12 


7. 


The SDI proxy- agent reports the user's final purchase decision to the 


13 




appropriate vendor, and denies purchase to the initial vendor if the user 


14 




accepts an alternative offer. 


15 
16 


8. 


The final transaction is executed. 



17 13.2.1.6 Automatic Buyer Coalition Generation 

1 8 The time of purchase competition system can also be used to allow the automatic 

19 formation of user coalitions. Coalitions can be generated dynamically based on the 

20 privileged position of the time-of-purchase server in the marketplace. The time-of-purchase 

21 server can transparently bundle purchases from disparate end-users, and negotiate quantity 

22 discounts with vendors. Buyer coalition formation can significantly improve negotiating 

23 leverage as the collective buying power of numerous individuals using the time-of-purchase 

24 competition feature, and be made completely transparent to users. The ability to offer a 

25 single vendor a number of sales allows significant negotiation leverage, and can be used to 

26 get special discounts. 

27 Although the coalition formation problem is very difficult in general, because 

28 individuals may not have the same goals and the computation of an optimal outcome with 

29 self-interested agents is hard (for example, how should the profits from a coalition be 

30 divided, how can we assess the value of each agent for the coalition, etc.) 
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1 The time-of-purchase server has information about the current and pending purchase 

2 requests from users in the system, and can make one-shot coalition formation decisions, 

3 with profits divided equally between agents. Agents provide a one-time bid for a service, as 

4 the offer that they are about to accept from a vendor. This is a guaranteed offer, as the 

5 purchase is about to be made. Therefore, it is a simple matter to aggregate bids from a 

6 number of agents as a bundle bid for a set of items, and negotiate a good price from a single 

7 vendor. This solution to coalition formation is simple, because it is not negotiated over many 

8 steps between agents, but performed one-time with agents tied into an ad-hoc coalition. 

9 For example, suppose that at any time, in the period of 10 seconds, 100 SDI agents 

10 are about to purchase compact discs, varying in price between $5.99 and $15.99. The time- 

1 1 of-purchase competition agent has this information, and can perform the following steps: 

12 1 . Collect time-of-purchase requests over a short period of time (e.g. 10 seconds). 

1 3 2. Identify purchases with a common type (e.g. compact discs). 

14 3. Aggregate the sum value of agent's buy prices for the purchases (e.g. sum the 

1 5 price that each agent is about to pay for a compact disk). 

16 4. Choose a discount on the TOTAL price, e.g. 10%. Make an aggregated buy offer 

17 to appropriate vendors on behalf of all clients (e.g. buy all 100 compact disks for 

18 a total 10% discount). 

19 5. If successful, divide the final discount over all clients, e.g. subtract 10% from 

20 every CD purchase. Perhaps take a share of the surplus as profit for the brokering 

21 role performed. 

22 The time-of-competition server can also take a position in the market, identifying 

23 bundled purchases over time, essentially aggregating demand and making offers of a 

24 guaranteed stream of purchases over a period of time based on historical information, again 

25 for a discount that can be seamlessly passed onto future purchasers. If the server is 

26 successful in taking a position in the market for some good that its user base seem to be 

27 interested in purchasing, then in addition to returning offers from vendors in the system, the 

28 time-of-purchase system can provide good prices to its clients. 

29 Buyers might also form coalitions on the basis of automatically detected synergies 

30 between their requests and predicted similarities between them. This can give buyers more 

3 1 leverage in negotiation with a vendor. In the automatic evaluation of how "good" an offer is 
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1 (how competitively it matches the market demand for that coalition's needs) in iamworthit's 

2 historical user information, it must also utilize information about the effect that size of the 

3 coalition has upon the market's responsiveness. Within the context of the present 

4 framework, the simplest version of the system involves creating the coalition around a 

5 particular item and, however, it also conceivable that it could be formed at the vendor level 

6 in which the coalition members are interested in any item(s) provided by the vendor and 

7 vendor competition is elicited by the coalition across all products which its members 

8 demonstrate interest in and which are provided by the competitive vendors. Given a 

9 sufficiently large subscriber base, buyer coalitions may be automatically created in real-time 

1 0 and in ad-hoc fashion (i.e. from the iamworthit buyers who are interested in a particular type 

1 1 of item at that particular time). 

12 Buyer coalitions typically are able to exert significantly greater pressure upon 

13 vendors the larger they are (this fact applies as well to multiple company b to b buyer 

14 coalitions). As such iamworthit may also notify other members which in accordance with 

1 5 their user profiles are likely to have an interest in a particular item (or items) in which a 

16 coalition of members have demonstrated interest in order to increase the size of the 

1 7 coalition. Similarly, the aggregate user profile of all members of the coalition may be used 

18 in conjunction with collaborative filtering in order to recommend those items which 

19 collectively best match the preferences of the coalition collectively and thus are able to 

20 provide the best deal for the most items possible to that particular coalition. Thus the 

21 coalition model is extended to placing competitive pressure upon vendors across multiple 

22 items, iamworthit must also utilize the above modeling techniques in order to establish 

23 pricing models based on the value of these different items in light of the total monetary 

24 value which the coalition represents to the vendor collectively. In addition to inferring 

25 buyer interest entirely passively, it is possible for buyers to manually edit their profile, e.g. 

26 by modifying or actively selecting categories of purchasables of interest which she/he 

27 would like to be notified of (e.g. via pager) itfwhen a buyer coalition for a purchasable of 

28 that type is being established. It is further possible to increase the size of buyer coalitions by 

29 allowing a certain amount of time for buyer's to respond to offers to join that particular 

30 coalition (e.g. 24 hours). Typically a couple of days is all the time that is required to achieve 

31 a substantially maximum buyer response. Also, because some vendors may wish to be 
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1 (automatically) informed iffwhen substantially large coalitions approach the vendor, there 

2 may be an additional marginal price advantage by providing an additional period for 

3 vendors to reply. 

4 The buyer may wish to allow the iamworthit time of purchase vendor competition 

5 system to operate in fully automatic mode whereby certain "acceptance parameters" are 

6 provided to enable the buyer agent to act fully or autonomously on behalf of the buyer's 

7 stated (or approved) parameters. This technique is likely to provide a central function 

8 particularly in most business-to-business iamworthit commercial applications. The 

9 business-to-business commercial domain may exemplify a couple of other novel features 

10 (which are not exclusively limited to business-to-business). For example, because 

11 replenishment of products or supplies suggest much greater repetition and thus 

12 predictability, it is sometimes useful to buyers to also leverage as part of the negotiating 

13 process an advanced commitment of buyer loyalty over an extended period of time. In the 

14 unlikely event of the buyer changing his/her vendor loyalty after considerable value has 

1 5 been provided to the buyer, the vendor may control the right to acquire certain assets of the 

16 buyer (e.g. corporate stock) as pre-agreed collateral for the vendor. Also, in the business- 

17 to-business domain iamworthit acts essentially as (or in conjunction with) a buyer-side 

1 8 portal to a web-wide vendor resource, though it could also be integrated as part of a seller 

1 9 side-portal interface tool as well. 

20 As an additional service to vendors SDI can provide enhanced profile information, 

21 aggregated from other vendors, to enable vendors to provide better focused offers than can 

22 be provided on the basis of the profile information directly associated with the pseudonym 

23 of a buyer. Certain portions of the user profile data that is unavailable for direct collection 

24 by the vendor (such as information that is collected on other sites including, in particular, 

25 competitive vendor sites) may reveal important information which enables the vendor to 

26 better target that buyer. As such the secure data interchange representing the collective 

27 buyers may aggregate, analyze and sell this data to the vendor so long as the release of such 

28 information does not negatively affect the predicted pricing or value levels for that buyer as 

29 performed by the above type of pricing algorithm. 

30 An interesting variation of this example involves the situation in which multiple 

31 entities represented by SDI users may be negotiating with the same other entity (or 
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1 individual). It may be for the same purchasable. Alternatively, it may involve different 

2 purchasables or the represented group may even be a combination of users and sellers who 

3 happen to be presently dealing with the same entity. Accordingly it is important for SDI to 

4 achieve an understanding as to the individual negotiating parameters as well as an 

5 assessment of a market demand model which characterizes the needs and objectives of that 

6 entity with regards to the particular prospective transactions being negotiated which SDI is 

7 mediating on behalf of each party which is captured through the main SDI server. As such 

8 it is often possible to thus attempt to predict the minimal acceptable terms of an offer which 

9 would meet those conditions. The negotiating leverage from all parties is based upon terms 

1 0 which affect multiple (or perhaps all) of the deals collectively. SDI thus has the power to 

1 1 significantly strengthen the negotiating leverage based upon a market demand model which 

12 characterizes that entity thus benefiting the entrusted parties on the whole. The statistical 

13 data used for market demand models from the user-centric or vendor-centric SDI is 

14 collected about the other parties which the parties are negotiating with. One useful source 

1 5 for estimating this predicted market demand curve which they represent. For example it is 

1 6 possible for end-users as a coalition of users to be automatically formed for this purpose. 

1 7 A user-centric SDI is then assigned to the coalition in order to (a) ascertain from user 

18 profile information (in addition to subsequent active query or recommended offer feed 

19 back/approval information about terms which the coalition would likely wish to receive 

20 from the vendor (subject to negotiation) (b) ascertain from vendor behavior relating to past 

21 users and coalitions terms which would be reasonable to expect from the vendor in order to 

22 anticipate reasonable offers/counter offers to present to the vendor. 

23 The user-centric SDI may even detect from the user profiles the possibility 

24 (followed-up by a confirmation query to the users) that the users may be willing to commit 

25 in advance for a specified period of time to certain customer performance criteria e.g. in 

26 exchange for not only guaranteed discounts but also an equity stake in the company which 

27 may increase over time and where continued ownership of which is contingent upon its 

28 sustained user performance as agreed upon in advance for the specified period of time. 

29 Assuming sufficient willingness, the coalition as directed by the user centric SDI will submit 

30 the recommended offer to the vendor along with its competitors using time of purchase 

3 1 competition. The coalition may even incorporate a believable threat (upon approval of its 
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1 members) (e.g., that it will not deal with the vendor in the future if the terms of its request 

2 (such as long-term discount privileges or conversely promise to give the vendor certain 

3 competitive advantages in future user coalition negotiations which may include future SDI 

4 coalitions. 

5 SDI's predictive tool suite can be used to predict the most likely prospects for a 

6 given coalition, i.e., matching users who have explicitly indicated an interest or have 

7 engaged in negotiations or past dealings with a given vendor. As indicated the 

8 recommended users may be targeted either with the same items or completely different 

9 items, the latter case in which SDI will try to negotiate a lower price based upon the 

10 aggregate sales form the coalition, not the volume of sales for one particular item per se, as 

11 in the first instance. In the present application, however, because negotiating pressure is 

12 being applied by a collection of entities upon (typically) one entity (or possibly alliance of 

1 3 entities), an SDI (which may typically be spawned from the main SDI service) representing 

14 the coalition is created in ad-hoc fashion. The other entity may accordingly wish to acquire 

1 5 its own SDI to also represent its commercial interests. Each SDI may represent multiple 

16 parties. No SDI must of course ever represent an entity which is negotiating with another 

1 7 entity which it represents. 

18 Coalitions are typically arranged automatically by SDI based upon observed 

19 synergies in user and seller activities which SDI observes. In fact the predictive capabilities 

20 of SDI could be used to model (1) the anticipated number of cooperating entities which are 

21 predicted to join the coalition and (2) from that information, the predicted terms and price 

22 which could be anticipated for the user coalition or (for the seller coalition). The anticipated 

23 price and volume of sales which could be anticipated if the entity were to accept SDI's 

24 invitation to participate in the coalition based upon the terms and conditions thereof as 

25 predicted to be most advantageous (collectively) according to SDI. 

26 13.2.1.7.1 Leveraging Control over a Vendor's Suppliers 

27 One of the roles of the iamworthit infomediary service could be extended to helping 

28 users not only identify vendors with whom they are most likely to do business with (vis-a- 

29 vie identifying the most desirable products/services for each user individually as the basis of 

30 the formation of the coalition), but also identifying of the other companies which do 
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1 business with the user recommended vendors which companies in particular may present a 

2 potential investment opportunity to the user. Or alternatively to determine which suppliers 

3 are potential candidates to do business with a company (or preferably multiple companies) 

4 which sell products which are of interest to the user. This SDI application is one of several 

5 conceivable applications by which the user is able through the intelligence gathering 

6 capability of SDI to identify potential opportunities which could potentially benefit the user 

7 which are also within control of the vendor wherein commercial pressure may be exerted 

8 upon the vendor via the coalitions. 

9 In this particular application, assuming that the supplier company is viable, an 

10 additional qualification as an investment opportunity to the user is how early is the 

1 1 development stage at that time. Additionally, vendors may be selected on the basis of their 

12 ability to provide products/services which match the customer's preferences and currently 

13 do business with suppliers which are prime candidates for investment (i.e., are early stage, 

14 preferably in need of capital investment and importantly are doing business with as many 

15 other vendors as possible which sell products/services of interest to the user. In our 

16 implementation, if the vendor is early stage (i.e., represents a reasonable investment 

1 7 opportunity for the (coalition members), the coalition may guarantee the vendor of certain 

1 8 customer loyalty commitments which are guaranteed (upon forfeiture if the terms of loyalty 

1 9 are not lived up to), certain shares of stock in the vendor's company. 

20 One criteria for the selection of this vendor includes the particular suppliers which 

21 he/she is currently doing business with or would be willing to switch to in exchange for the 

22 coalition's business which is typically a long-term commitment. In conjunction the 

23 selection of vendors which the coalition approaches is based upon the willingness of other 

24 competitive vendors to conform to the wishes of the coalition with regards to the vendors' 

25 selection of suppliers typically. Several multivendor (supplier) relationships currently exist. 

26 And the objective of iamworthit is to identify suppliers which are well qualified to satisfy 

27 the needs of the vendors which are candidates for their businesses and which similarly 

28 provide significant growth potential, if these vendors switch over to these suppliers. If the 

29 coalition's offer is successful, it will invest in the suppliers), prior to its public 

30 announcement of having obtained the vendor(s) as a customers). If the supplier is in 

3 1 substantial need of investment (e.g. a private placement), it is even conceivable that the 
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1 coalition may be compensated by the supplier in the form of monetary compensation or 

2 preferably free or discounted product from the vendor for which they provide their 

3 product/services. This may further assure user loyalty from the coalition and is typically 

4 limited to an initial period needed to entice the coalition members that the investment in the 

5 supplier is worth while in light of the rather high risk in such an early stage company. 

6 Other more elaborate applications are also conceivable, for example, more general 

7 requests for knowledge and influence over particular executive decisions which are not 

8 highly confidential which may provide benefits to the members of the coalition e.g. in 

9 addition other types of partnering decisions with other vendors also potentially to strategic 

10 product development directions and revisions which provide the coalition optimal personal 

1 1 satisfaction and value for their purchasing commitments. It is even conceivable that through 

12 the aid of vendor credentials (for identification purposes) user coalitions could bargain with 

1 3 vendors using "restricted currency" which stipulates its use must be for particular uses, (or 

14 its value for any other users is diminished) (such as vendors in which the coalition has an 

1 5 investment state). 

16 In a variation, these restrictions could "encourage" subsequent transacting parties of 

17 the currency (e.g. the employees of the company) to buy from particular vendors which the 

1 8 coalition support (e.g. in conjunction with or equity stake), perhaps thus encouraging the 

19 employees join the coalition as well. Data mining (performed by SDI) will also indicate 

20 existing vendor relationship opportunities for investment between/within (respectively) 

2 1 different vendors as well as the flow of currency through sequences of transactions in which 

22 these complex vendor synergies exist even indirectly via one (or multiple) removed vendor- 

23 vendor and user- vendor relationships. 

24 13.2.1.7.2 Coalitions Created For Purposes Of Eliciting Change On The 

25 Part Of Vendors 

26 In addition to applying market pressures, one other example in which entities may be 

27 automatically introduced into coalitions using SDI may involve SDI automatically 

28 identifying certain situations or complaints from users or entities in which there may be 

29 common ground for these parties to collectively apply political or legal pressure upon 

30 another entity from which the problem arises, or in one example to apply legal tactics for 
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1 purposes of reclaiming damages from the infringing party. For example, members of SDI 

2 could be asked to disclose particular instances of these types of legal infractions even if they 

3 are relatively minor (e.g. simply a short fall in acceptable quality of misrepresentations 

4 through misleading advertising etc. may be relatively minor inconveniences and/or a site 

5 may be provided as well for non-SDI members (e.g. www. cl ass . action . com . 

6 A similar application tailored specifically to the Federal Government could also be 

7 provided (e.g., www. government industries.com. In one example, SDFs ability to provide 

8 annotations excluding complaints and ratings pertaining to particular organizations and 

9 individuals is complemented by the current scheme by introducing the ability of users to 

10 apply pressure to the organization or individual as a coalition in order to more effectively 

11 motivate the desired change accordingly. To this end other tactics may be used, for 

12 example, a threat to collectively sponsor negative advertising or editorials. In fact, the 

13 above technique for presenting a believable threat could be utilized in this case and backed 

14 up by an insurance policy which provides in addition to the guaranties that the threatened 

1 5 action will be carried out, for example, by X date unless a particular condition is met, also 

16 provides sufficient funds to follow-through accordingly and in one variation further instill 

17 the motivation on behalf of the parties by actually insuring the outcome of the litigation 

1 8 (where the threatened (and guaranteed) consequence is sufficiently great that the entity is 

1 9 much rather proved to settle thus reducing exposure significantly. 

20 Within the present system SDI provides the framework by which appropriate users 

21 and sellers may be matched together. It also enables a methodology by which the user 

22 interests are protected through the use of matching of sellers offers to competitive vendors 

23 (using iamworthit). 

24 1 3.2.1 .8 Single-user Time-aggregated Purchases 

25 Another extension is to propose a special interest-bearing bank account which we 

26 set-up for the user which is tied into a wallet or debit/credit card for off-line transactions. 

27 We can allow a user agent to bundle its own purchases over time, and use the system of SDI 

28 to guarantee future payment to on-line vendors in return to a good current offer. This is 

29 novel, because it is like participating in a "discount scheme" without purchasing the right to 



Page 264 of 370 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 future discounts up front. A user can still invest money that is earmarked for future 

2 purchases with a vendor. 

3 We describe a wallet application for this purpose, that allows a user of SDI to 

4 commit currency to vendors for future purchases. The time-of-purchase competition system 

5 can allow vendors to see the pseudonymous purchasing profiles of users (e.g., over the past 

6 12 or 18 months) which are relevant to their respective commercial venues and 

7 competitively bid for the opportunity to acquire from that user a "pre-commitment" for 

8 similar spending amounts within the same time period going forward (provided that that 

9 vendor offers the same/similar categories from which the user had purchased that amount in 

1 0 the past from another vendor(s). 

1 1 The "committed currency" in the user's account is not actually spent and thus 

12 remains interest-bearing for the user, and because the dollars are exclusively and 

13 irreversibly earmarked, the actual debit could even occur substantially after the actual 

1 4 purchases ultimately occurs with that vendor. Vendors marketing the service may be able to 

1 5 exclude competitive offers (as is the case with the standard time-of-purchase competition 

16 application). 

1 7 1 3.2. 1 .9 Special Vendor Treatment in a Community Dollar Scheme 



18 There is a variation of community dollars in which some of the competitive 

19 features of the time of purchase competition system can be utilized by vendors to provide 

20 a competitive advantage over other vendors. 

2 1 Each vendor is given exclusive rights to information about offers which users who 



22 subscribed through their site or portal are currently considering. All other competitive 

23 vendors are denied access to time-of-purchase competition for that user. The vendor may 

24 request profile data from the user which s/he may wish to optimize the selection/features 

25 of the particular product offering as well as the value of the offer, and use SDI or its own 

26 price engine as before. 



27 If the user has also subscribed to full time-of-purchase competition, he/she may 

28 be willing to give the vendor the right to provide the first counter offer. If the user does 

29 not accept the counter offer, the system can default to the standard multi-vendor 

30 competitive scheme. 
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1 Vendors signed up with the community dollars scheme may also be provided with 

2 special privileges for advertising to a user as the user browses the Internet. This can be 

3 used to encourage more vendors to sign into the SDI/community-dollars scheme, to be 

4 exempted from ad replacement technology and receive other preferential treatment. 

5 The present time-of-purchase competition scheme can also be adapted for the application 

6 to television commercials. In a preferred commercial mode a relationship is created 

7 between iamworthit and one of the providers of commercial insertion technology (at the 

8 head end). All cable providers not utilizing the particular commercial insertion 

9 technology provided by that particular partner provider would be subject to the use of the 

10 commercial replacement technology which iamworthit provides by delivering typically 

11 (but not exclusively) digital set top terminals streaming (HTML based) video 

12 commercials or the same preloaded onto the set top for subsequent display. The system 

13 is able to recognize if/when commercial breaks occur during the course of standard video 

14 programming through tags which are typically digitally encoded or (in the case of analog 

1 5 TV) transmitted within the vertical blanking interval. 

16 The system could also allow a vendor to deliver (again by predefined rules) 

1 7 certain criteria for undercutting, in real time, any offer which the user browses (or even 

18 receives via e-mail). As in the case of time of purchase competition, these dynamic 

19 counteroffers can be delivered to the user via either the advert replacement system or a 

20 pop-up window/java-script window. 
21 

22 Vendors might even compete to "buy" the rights to exclusively deliver 

23 competitive offers to any offer which the user may happen to view and which is 

24 competitive to another product/service which the vendor provides. The terms of the 

25 rights which the vendor actually purchases may include the number of competitive offers 

26 which the vendor can provide in the future, the degree of the conditional "value" which 

27 the vendor would provide through the competing offer, the category(s)/domain(s) which 

28 the vendor purchases the right to compete, the degree of exclusivity/non-exclusivity. A 

29 vendor may also wish to constrain the ability of potential competitors to submit counter 

30 bids. 
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1 Vendors in all other product categories can retain the ability to compete for any offer 

2 which the user receives. It is possible that vendors with exclusive rights may also try to lock 

3 a user into future purchases. Again, this ability to lock users into future purchase 

4 commitments may be auctioned to competitive vendors. The dedicated currency auction 

5 web site (e.g., www. creditauction . com ) is architecturally very similar to the time-of- 

6 purchase competition variation of the service with the exception that it is a market place in 

7 which many users may submit their profiles with request for offers from vendors across any 

8 or all categories relevant to their profile as requested or agreed to by the user. A variation of 

9 this dedicated currency involves a scheme for time shifting into the future the transfer of 

10 funds from the users account to the vendor's (thus allowing the users money to accrue 

1 1 interest during that period). Thus dedicated currency whether it proceeds or preceeds actual 

12 purchase events guarantees the vendor sales which can be used plan capital 

1 3 improvements, attract credit or investment funding while providing benefits such as 

1 4 and/or interest on money spent to consumers. 

1 5 This reverse auctions for time-shifted purchases may also encourage vendors to form 

1 6 virtual retailer communities, to accept a common currency and offer a full range of products 

1 7 and services to a user. Any offers involving the user dedicating any form of 'Value" for the 

1 8 user for future purchases of products/services is applicable towards purchases at any of these 

19 vendors in the community. SDI can be used to identify optimal combinations of vendors, to 

20 best satisfy the customer's needs. Unlike prior art multi-vendor loyalty point systems, in 

21 which the selection of vendors is static across all users, this allows the dynamic creation of 

22 loyalty schemes. This is possible because of the consumer preferences which can be made 

23 available via SDL In cases where vendors offer some common products it may be possible 

24 to redeem value at either vendor's site, or to restrict a purchase to products at only one of 

25 vendors. From a data analysis perspective a primary criteria for matching vendors is to 

26 identify those which have products predicted to be interesting to the user are metrically close 

27 to each other within user (cluster) space (are predicted to be preferred by many of the same 

28 customers) AND have a minimal number of products which overlap, thus minimizing 

29 collaboration among competitors. 

30 The actual debiting of a user's account can occur substantially after a commitment 

3 1 to spend money with a vendor, allowing a user to continue to accrue interest. In this 
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1 multi-vendor mini market the money can be earmarked for that vendor exclusively at the 

2 moment of purchase. 

3 13.2.1.10 LEIA Enhanced Time-of-Purchase 

4 One application of LEIA involves the integration of the time of purchase 

5 competition architecture. In particular, purchasables which a user is browsing with an 

6 associated offer price may be either purchasable on-line or off-line. Based upon his/her 

7 present physical location, if the user so desires s/he may identify not only other similar items 

8 but also that subset of these items sold by bricks and mortar vendors which are physically 

9 proximal to the user's current location (thus available for immediate purchase). By 

1 0 introducing location enhanced information to users about their relative similar proximities to 

1 1 off-line purchasables the time of purchase vendor competition model is further enhanced by 

12 placing additional competitive pressure upon those vendors selling similar goods 

1 3 exclusively on-line (which are not immediately available to the user). In this environment if 

14 an off-line product offer is ultimately accepted by the user, a digital certificate could be 

15 securely conveyed to the user or authentication of his/her identity could be established at 

1 6 time the offer was accepted and upon redemption. 

1 7 The Location Enhanced Information Architecture (LEIA) enables vendors in the 

1 8 vicinity of where a user is traveling to deliver a targeted ad or particular personalized 

19 offer. Or a user may search or navigate directories on the Web or browse a particular 

20 (competitive) item. Regardless of the mode of access, the system filters or identifies 

21 purchasables by their direct physical proximity to the user (or the user's intended travel 

22 path). If the time of purchase competition system uses location proximity of an item as a 

23 parameter for vendors to compete with one another (i. e. providing an additional 

24 speed/convenience factor for on-line items with an off-line sales presence) the particular 

25 vendor's price which is finally agreed upon and accepted by the user can be securely 

26 enforced at the point of sale using a digital coupon which is non-transferable and non- 
27 reusable may be down-loaded to the user's device, printed or simply securely transmitted 

28 to him/her as a unique code redeemable with user identification at the physical point of 

29 sale. These techniques are fully disclosed in co-pending patent application entitled 

30 "System for the automatic Determination of Customized Prices and Promotions". 
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1 13.2.1.11 Extended Example: Medical Insurance Application 

2 Problem statement: 

3 Increasingly, insurance companies are denying claims for medical services. 

4 Hospitalized patients frequently have in-patient days denied, in part or totally, and 

5 associated services and studies may also be denied. Days of partial or total denial are 

6 especially important for non-DRG (Diagnosis Related Group, Non-Medicare) insurance 

7 subscribers. Also in-patient and out-patient testing and services are frequently denied at the 

8 discretion of insurance administrators. These denials persist despite consistent efforts by 

9 hospitals and hospital personnel to properly document the appropriate protocols to alleviate 

10 patients' illnesses. Similarly, out-patient medical care in doctors' offices is not infrequently 

11 denied, and to the extent of prohibiting effective diagnosis and treatment. At times, 

12 hospitals as well as out-patient office personnel do not have the necessary information to 

1 3 know, prior to treatment, which services are eligible for reimbursement, partially because of 

14 the multitude of coverage patterns. The providers must frequently appeal non-payments 

1 5 retrospectively, by letters, further documentation and repeated telephone calls, all of which 

16 may or may not be effective, apparently depending on the capriciousness and the judgment 

17 of the particular clerk or medical personnel in the carrier's office. These situations have 

1 8 produced enormous pressures to improve communications by extracting detailed data from 

19 providers pertaining to carriers failing to support appropriate and necessary health care 

20 delivery and documentation of how these insurance issues have caused definitive health 

21 problems. This data will be used to exert pressure on insurance carriers. In view of these 

22 major economic pressures, hospitals and other providers of medical services will definitely 

23 be most cooperative in providing detailed billing and clinical information with appropriate 

24 release consents, or alternatively through deployment and utilization of the proprietary 

25 techniques for privatizing patients' records using a pseudonymous proxy server. This 

26 information will be provided by the billing and medical records departments of the hospital 

27 and also, in the case of hospital employees, from the Human Resources Department of the 

28 hospital for reimbursement for medical services for hospital personnel. The minimal 

29 information should include the billing record for those in-hospital days or the specific 

30 services/tests denied and the discharge summary of that hospitalization. Based on reviews 

31 of this material, further portions of the in-hospital clinical record may be requested. 
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1 (Additionally, the iamworthit service provides for the utilization of the above mentioned 

2 proxy server for collecting and maintaining pseudonymous patient data whereby advertising 

3 revenues from industry advertisers accessing/targeting desired user profiles may be shared 

4 with the hospital and potentially also the patient, thus further insisting the hospital to provide 

5 this needed patient data). 

6 A compounding problem is the lack of information provided prospectively by 

7 insurance companies to hospitals, physicians and other providers. Subscribers and 

8 providers have the right to know, in detail, the number and kinds of services covered, and 

9 the depth of the insurance coverage including the length of stay. This information is 

10 legally accessible and can be made available not only to the patient, but also to his/her 

1 1 physician. On exemplary commercial need for the present service by employers who 

1 2 wish to reinsure their employees with quality coverage. 
13 

14 Solution: 

15 In order to protect the consumer, a solution proposed to address a timely and 

1 6 overwhelming need on the part of consumers to establish a control data bank, accessible 

17 to the consumer, which reveals general and specific instances of miscarriages by health 

18 insurance carriers of their fiduciary obligations to provide timely payment for essential 

19 health care services. Based upon data regarding the insurance company's stated policy 

20 and statistical analysis of the ultimately approved or disapproved coverage, performance 

21 criteria may be determined pertaining to the percentage likelihood of the insurance 

22 company covering certain services and length of hospital stays for future services to 

23 patients. In order to provide a robust and comprehensive statistical analysis, it is also 

24 required that a certain core sample of non-coverage denied service be collected in order 

25 to determine a variety of correlation criteria which affect the probability of coverage in 

26 addition to the identity of the carrier, also the type of prescribed medical care for 

27 associated illness wherein coverage for hospital stay or treatment is denied. Other 

28 correlations are possible, e.g., the medical history of the patient, the family history, etc. 

29 Of additional importance, statistics may also be provided which indicate the degree of 

30 adherence (integrity) of an insurer to provide coverage and to provide it with the level of 

31 depth of coverage which was purported by the insurer according to its standard claims 
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1 policy. The iamworthit insurance service is accordingly designed with two primary case 

2 objectives: 

3 1) Provide detailed statistics based on robust historical data regarding realistic 

4 expectations which the patient can anticipate regarding acceptance, denial and quality 

5 on the basis of various types of coverage given the context of the insurance needs and 

6 requirements of the buyer and based upon the specific services and depth of coverage 

7 purported by the insurer and any associated medical information regarding the buyer. 

8 The service could even anticipate, by review of the collective medical statistics the 

9 most likely type of coverage the buyer may need, based on his/her medical record 

1 0 specifically within the context of the associated determined probability of coverage 

1 1 and quality thereof for each insurer in view of this medical history of the buyer. In 

12 addition, iamworthit may provide a general informational resource where buyers can 

1 3 access such statistical information by carrier. Data mining tools may enable the buyer 

14 to extrapolate the desired statistical correlations as to his/her type of profile of 

15 medical or specific features of his/her medical condition or specific type of coverage 

16 and thus in light of this robust data better equipping the buyer to determine which 

17 insurance provider and associated policy is most appropriate for his/her specific 

1 8 health care coverage needs; 

19 2) This information may be accessed in real time by the buyer (or for e.g. buyers, 

20 employer) while browsing specific insurers' sites, receiving specific offers by 

21 insurers (typically requiring certain medical information) or browsing an insurance 

22 retail portal. Typically, buyers of the present service will combine their use of the 

23 present browsing-based information with the Time-of-Purchase Vendor Competition 

24 Services (described below). Additionally, a rating feature may also be provided 

25 which enables the buyer to rate (and subsequent buyers to observe the averaged 

26 ratings) the various vendors by various types of criteria relevant to the buyer's 

27 experience in receiving acceptance, quality and depth of coverage (as well as how 

28 well the insurer adhered to promises for coverage as stated in its standard policy for 

29 coverage), of course, this feature could be used if for insurance products on-line for 

30 later purchase off-line. 
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1 Iamworthit's time-of-purchase vendor competition application to medical insurance 

2 (as applies similarly to numerous other e-commerce commercial domains) involves the 

3 following protocol: 

4 The iamworthit enabled consumer provides specific needs or requirements as to 

5 what types of medical insurance products or areas of coverage are desired. These requests 

6 may be submitted either to a portal (or "mall") which connects the buyer to an extensive 

7 resource of medical insurance providers, or, alternatively, these requests may be submitted 

8 to iamworthit which provides its own default portal to these resources (which likely exists 

9 and is marketed as an independent marketplace for insurance purchasers on its own behalf). 

1 0 Upon submitting these buyer requests, iamworthit is able to provide numerous competitive 

1 1 insurance providers offering similar insurance products which in this application may be 

12 accompanied by certain credentials (such as the buyer's age and many other clinical 

1 3 parameters). 

14 Additionally, the pseudonymized digital medical records of the buyer (which may 

1 5 also be requested by an insurer, typically in addition or instead of submitting a questionnaire 

16 regarding the buyer's health such as certain pre-existing medical conditions). Typically, 

1 7 patient medical records are a prerequisite to submitting an offer. The insurers may then 

1 8 provide specific offers for their respective products. Additionally, the buyer may browse 

19 various insurer's sites, providing pseudonymous medical data to the extent (and type) that it 

20 is requested and, specifically, to which providers the buyer desires. As offers from 

21 insurance providers are collected (in response to requests and/or browsing interactions), the 

22 time of purchase vendor competition is elicited whereby each insurance provider is 

23 automatically notified as to the most valuable offer(s) provided by a competitors) and are 

24 given the opportunity to submit counter offers accordingly. It should be noted that the 

25 determination of the value may be in accordance with certain buyer prescribed 

26 specifications such as price or certain expert determined criteria which consider either the 

27 price in combination with completeness and depth of coverage and, if relevant, the level of 
2 8 satisfaction of the buyer' s request criteria. 

29 Additionally, iamworthit provides the ability for buyers to share the information of 

30 offers they have received in accordance with the associated medical profile and/or response 

31 to health related questionnaire information, in order to determine if an offer which is 
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1 submitted by an insurer inappropriately discriminates against the current buyer in terms of 

2 price or depth/comprehensiveness of coverage, or where such discrimination occurs in 

3 conjunction with buyer medical data which is prohibited by insurance regulatory guidelines 

4 as a valid criteria for such discrimination. Iamworthit's insurer coverage statistics (which 

5 the buyer may typically wish to observe in conjunction with the present time of purchase 

6 vendor competition functionality) are an additional important value determination factor 

7 (which may be factored in as an additional weighting criteria as numerically prescribed by 

8 the service (or by the buyer) and/or accordingly for purposes of setting a threshold by which 

9 insurers become disqualified as candidates for the buyer's business. In a variation, as is 

10 provided by the current model, buyers may instead first receive the historical offer 

1 1 information from other "similar" medically profiled buyers in order to submit a bid to that 

12 carrier which offered the most relevant and beneficial offer as well as all of the other 

13 competitive insurers. As is also provided according to the current scheme, the present 

14 iamworthit time of purchase vendor competition facilities may operate fully autonomously 

1 5 on behalf of the buyer (incorporating, if desired, specific request criteria). 

1 6 Thus, optimal offers are provided to the buyer by a "buyer agent". Additionally, the 

17 present service may with some modification be provided to buyers dialing into an 800 

1 8 number. Typically, in this case, however, (as is also the case in the on-line implementation), 

19 if the buyer does not have access to his/her pseudonymous medical records, he/she would 

20 typically answer a questionnaire for each provider (which would typically consist of an 

21 aggregate questionnaire satisfying the requests of all relevant carriers which could then be 

22 submitted automatically, or an agent could be programmed to automatically complete the 

23 questionnaire as needed. 
24 

25 Commercial Alliances with Regional or National Health Care Organizations - 

26 Because of the tremendous restrictions in providing insurance coverage to 

27 organizations which provide health care services, it will be greatly to their advantage for 

28 patients and physicians to be able to benefit from more fair and competitive insurance 

29 coverage by using time of purchase competition and the associated coverage probability 

30 service (providing full disclosure of non-coverage and inequities in delivery of coverage). 

3 1 The primary joint marketing entities which will benefit most significantly from this service 
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1 and thus will be the strongest commercial allies, include independent physicians, medical 

2 clinics, large physician groups, pharmacies and perhaps even pharmaceutical companies, 

3 providers of medical supplies, hospitals and home health providers. Of the above entities 

4 listed, physician groups, clinics, and hospitals do a tremendous amount of advertising 

5 directly to consumers (and this is becoming increasingly true via the internet for physicians 

6 in private practice), particularly through the direct advertising medium of the internet (e.g., 

7 advertising on their web sites as well as email). These organizations, in light of their direct 

8 marketing exposure to the consumer which they provide, would likely become powerful 

9 allies in assisting the promotion and exposure of the competitive insurance with full 

1 0 disclosure of claim information service to their consumers. 

1 1 Secondly, health care organizations which market to health care professionals, such 

12 as physicians and nurses, including pharmaceutical companies, pharmacies, and home 

1 3 health care providers may indirectly benefit by informing their target customers about the 

1 4 above consumer oriented service. 
15 

1 6 Quality Ratings of Medical Suppliers 

17 iamworthit's insurance application provides a natural entry into a second 

1 8 commercial application of the service. In a fashion similar to that of Time-of-Purchase 

19 Vendor Competition Scheme, for purchasers of insurance products, another useful 

20 commercial application of the present service is facilitating the time of purchase vendor 

21 competition between medical suppliers and equipment providers. The synergy by which 

22 this second commercial application is enhanced is as follows: physicians and hospitals will 

23 likely be cooperative allies to the service in providing patient information substantially as 

24 required as well as promoting (along with home health care providers) particularly the 

25 iamworthit insurer coverage probability service to their consumers. They will also likely 

26 utilize such a resource for internal uses and if7when HMO service liability legislation 

27 becomes enacted, for legal purposes as well. 

28 To the buyer's advantage, it is possible (as in the insurance application, albeit less 

29 likely) that unfair or arbitrary price discrimination may be detected and counteracted by 

30 iamworthit's data sharing feature. Common buyers in this environment include hospitals, 

3 1 physician clinics, home health care providers and to a lesser extent HMOs and end-buyers, 



Page 274 of 370 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 the nature/types of supplies and equipment which tend to be purchased by each of the above 

2 categories of buyers are provided in further detail below. As suggested earlier in this 

3 description (and in issued patent "Pseudonymous Server for System for Customized 

4 Electronic Identification for Desirable Objects"), a vertical portal as the one described, may 

5 be personalized based upon the user profile as disclosed upon accessing that portal in 

6 general, at the item level as within a category (or in conjunction with search results). 

7 Additionally, vendors may be equipped with price discrimination tools and are in fact highly 

8 motivated to form a coalition by entrusting their data and counter offer responses to 

9 incoming time of purchase vendor competition offers and bids to SDI (or at least an industry 

10 specific counterpart of SDI) for medical suppliers in order to provide limits or "price 

1 1 ceilings" for any given buyer and associated profile and condition relating to competitive 

12 offers or bids. 

13 As in the case of almost any category of purchaser, particularly the larger entities, 

14 such as, unions, hospital systems and large physician organizations, additional bargaining 

15 leverage can be applied to suppliers because of the volume of supplies which can be 

16 purchased. The iamworthit model is particularly also likely to facilitate the buyer's leverage 

1 7 when negotiating long-term contracts with suppliers through the buyer-buyer data sharing 

1 8 and real time competition/bidding facilities which iamworthit provides. 

1 9 Finally, it is worthy to note that as in the insurance application, a rating feature may 

20 also be provided which enables the buyer to rate (and subsequent buyers to observe the 

2 1 averaged ratings) the various vendors by various types of criteria such as quality of products, 

22 promptness of delivery, quality of service (or as in medical equipment, quality and 

23 promptness of technical assistance/repairs), availability of desired inventory etc. 

24 There are a variety of types of health care organizations which could directly benefit from 

25 the time of purchase competition and quality rating service of medical suppliers which are 

26 described below: 

27 (a). Physicians Clinics and Large Physicians Groups - Typical types of medical supplies 

28 and equipment which these organizations will purchase using the iamworthit medical 

29 supplies application include: equipment such as X-ray equipment, ultrasound units, EKG 

30 units, EEG units, computer and other office hardware, chemical analyzer units, clinical 

3 1 supplies, office furnishings, pharmaceuticals, etc. 
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1 (b). Hospitals and Hospital Systems - Typical medical supplies and equipment which 

2 hospitals will accordingly purchase include: MRI units, Cat Scan Units, X-ray units 

3 (including units for special studies and mammograms), radiation therapy units, stress-testing 

4 equipment, hematological and chemical testing equipment, surgical supplies, prosthetic 

5 supplies, patient-care supplies for medical and surgical floors, IV solutions and 

6 pharmaceuticals, physical plant operational and replacement equipment and supplies, etc. 

7 c). HMOs - Currently, a certain percentage of patient health plans include coverage for 

8 pharmaceutical products. These products are typically provided through contract with 

9 centralized pharmaceutical suppliers. It is accordingly reasonable to suspect that many of 

10 these HMOs will invite the opportunity to provide these supplies through a more open 

1 1 dynamic marketplace environment as is provided by the iamworthit service. 

12 (d). Union Directed Health Care Organizations - Union clinics typically purchase 

1 3 equipment and supplies, similar to private clinics or large physicians offices, as described 

14 above. Union private health care organizations would be compelled to incorporate the 

15 iamworthit medical supplies application as it similarly could apply to all of the above 

16 entities. 

17 (e). Home Health Care Market - Equipment and supplies which are, in part, similar to large 

1 8 physicians clinics as well as some hospital - type supplies, used to provide out-patient care 

1 9 and maintenance services. 

20 Patient Medical Records 

21 The secure data interchange provides a secure and privacy protected storage and 

22 retrieval architecture in which buyers (in this case patients) are able to control the use of 

23 their personal profiles (in this case medical record information). Herein, we propose an 

24 alternative model which is based upon the assumption and belief that purveyors of this 

25 medical information, including hospitals, physician's offices/clinics, labor unions, (to some 

26 extent) HMOs are legally permitted to disclose and use their information for advertising 

27 purposes so long as a method is provided which insures that their information will not be 

28 associated with any identifying information about the patient (name, address, social security 

29 number, etc.). If regulatory constraints and/or the purveyor of this information prohibits the 

30 use of this information for the above purpose, patient permission could be acquired by 
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1 patient signature on all appropriate consent forms at the time of admission and directly on 

2 the admission form. 

3 In one variation, software may be provided to the organization which enables the 

4 organization to operate their own pseudonymous proxy server, thus enabling access by 

5 advertisers to the pseudonymized patient records. Email and/or telephony-based 

6 pseudonymous communications could be readily provided. For direct mail, the address 

7 information maintained by the hospital would mandate the hospital to provide direct mail 

8 advertising to the patients or to entrust this operation to a pseudonymous physical mail 

9 service such as SDI. In the (much) preferred variation, a trusted pseudonymous proxy 

10 server operator maintains the patient information on an external database (this would 

1 1 perhaps be SDI or an SDI-like service) in which all of the above pseudonymous 

12 communication media could be provided along with pseudonymous direct mail services. 

13 For the SDI data entrusted variation, the preferred business model to provide incentives 

14 to these organizations to cooperatively provide this data would involve SDI's privacy 

15 policy enforcement capabilities in combination with a revenue sharing model in which 

16 revenues from advertisers such as pharmaceutical companies, pharmacies, clinics, 

17 HMOs, hospitals (where the purveyor of the data is not a provider of the same 

18 competitive services). In the case in which HMOs are the advertisers, the HMO would 

19 identify patients who are currently subscribed to competing HMOs and (typically) have a 

20 medical history indicative of a good risk to the HMO. The iamworthit insurance 

2 1 competition scheme, as above described, could be implemented within the context of this 

22 current solicitation scheme in order to elicit time of purchase competition among HMOs 

23 upon the submission of an offer by the original HMO to the pseudonymous patient. 

24 Advertising purposes so long as a method is provided which insures that their 

25 information will not be associated with any identifying information about the patient (name, 

26 address, social security number, etc.). If regulatory constraints and/or the purveyor of this 

27 information prohibits the use of this information for the above purpose, patient permission 

28 could be acquired by patient signature on all appropriate consent forms at the time of 

29 admission and directly on the admission form. 

30 In one variation, software may be provided to the organization which enables the 

31 organization to operate their own pseudonymous proxy server, thus enabling access by 
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1 advertisers to the pseudonymized patient records. Email and/or telephony-based 

2 pseudonymous communications could be readily provided. For direct mail, the address 

3 information maintained by the hospital would mandate the hospital to provide direct mail 

4 advertising to the patients or to entrust this operation to a pseudonymous physical mail 

5 service such as SDL In the (much) preferred variation, a trusted pseudonymous proxy 

6 server operator maintains the patient information on an external database (this would 

7 perhaps be SDI or an SDI-like service) in which all of the above pseudonymous 

8 communication media could be provided along with pseudonymous direct mail services. 

9 For the SDI data entrusted variation, the preferred business model to provide incentives to 

1 0 these organizations to cooperatively provide this data would involve SDFs privacy policy 

1 1 enforcement capabilities in combination with a revenue sharing model in which revenues 

12 from advertisers such as pharmaceutical companies, pharmacies, clinics, HMOs, hospitals 

13 (where the purveyor of the data is not a provider of the same competitive services). In the 

14 case in which HMOs are the advertisers, the HMO would identify patients who are currently 

1 5 subscribed to competing HMOs and (typically) have a medical history indicative of a good 

16 risk to the HMO. 

17 The iamworthit insurance competition scheme, as above described, could be 

1 8 implemented within the context of this current solicitation scheme in order to elicit time of 

19 purchase competition among HMOs upon the submission of an offer by the original HMO 

20 to the pseudonymous patient. 

21 13.2.2 Matchmaking 

22 The user-centric SDI model allows users to provide personal information on a 

23 carefully controlled basis to vendors and other users. Furthermore, vendors can implement 

24 rules that personalize the information, products, and service provided to users — on the basis 

25 of personal information that they receive from users directly, or have acquired about users. 

26 The key property of "matchmaking" applications is that information exchange is bi- 

27 directional, i.e. a match is not computed on the basis of static information within the system, 

28 but may involve additional information exchange between agents, and/or special actions by 

29 one or more agents. 
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1 Client-level SDI proxies can act as autonomous agents in an architectural variation 

2 of SDI, where the "client-level proxy" is co-located with a (physically) mobile user, for 

3 example on a palm-held computer or head-up display. In a "match-making" application the 

4 goal of these SDI proxy agents is to find other agents with desired profiles/synergies, with 

5 the view to taking an action or exchanging a particular type of information. We describe 

6 many applications, for example: to matching buyers with a virtual sales force, to a market 

7 for expert opinion, to groupware systems for virtual team working, and to an investment 

8 system. 

9 We can allow agents to communicate anonymously, without revealing (even 

10 pseudonymous) identities. In addition, we can prevent one agent receiving profile 

1 1 information about another agent unless authorization is provided within the rules of data 

12 release associated with the agent's profile. Implicit authorization occurs when a requestor 

13 can present certificates to verify that it has required attributes to access particular 

14 information. Explicit authorization occurs when a requestee provides direct authorization 

15 to a particular part of a user's pseudonym. 
16 

1 7 Essentially there is bi-directional information filtering: the requestor agent will only present 

18 certain information to the user, information that is relevant; and the requestee will only 

19 provide information when a request is judged to be legitimate. Information exchange 

20 between agents occurs as part of a multi-step negotiation, until both parties can agree on 

21 terms for either a physical meeting (or execution of a deal), or further pseudonymous 

22 exchange of information or cooperation. 

23 In match-making applications the role of the central SDI query execution engine is 

24 to search continuously for "matches" between agents, based on agent profiles. When a 

25 match occurs additional information exchange may occur automatically between agents, 

26 until finally an introduction is made or an action executed by one (or both) of the agents. 

27 Decisions about what information to exchange are made on the basis of both static and 

28 dynamic profile attributes, e.g. standard (historic) profile information, current behavior, 

29 current location, and recent activity. Also mode of behavior e.g., social, business, leisure as 

30 detected by the user's calendar,, time, content in recent conversations, type of users recently 

3 1 interacted with, etc. LEIA style-behavior attributes can be used to automatically decide on 
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1 the relevance of new virtual tag information. A requestee might also demand certain 

2 credentials to indicate the lack of negative reputation marks, for example that an interaction 

3 with the user has never received a bad rating (see Section 2.2.3). Perhaps a third-party could 

4 be used to determine whether the user's know each other (e.g. www.sixdegrees.com ). When 

5 a requestee denies a request for information, it may instead provide criteria for data releases. 

6 A requestor can respond with a different information request, or a subset of required 

7 credentials. Finally, the agents might agree on terms of negotiation and conditions can be 

8 anonymously fixed. 

9 There are (at least 5 levels) of information disclosure: indicate to another user 

10 interest; release profile information; disclose communication; start a correspondence 

1 1 session; schedule a meeting/strike a deal, take some other action, etc. The end-result of 

1 2 information exchange could be an agreement to calendar a meeting for some future time and 

1 3 place; and absolute, or pseudonymous revelation of identity. 

14 An initial implementation of the data-release policies might allow only manual 

1 5 definitions. However, after an initial "beta testing" phase, a data mining suite could be used 

1 6 to cluster users and generate exemplar data release and data request policies. A system can 

17 provide default settings for users, and recommend setting based on users with similar 

18 profiles. The user can further fine-tune the rules. Automatic feedback techniques can be 

1 9 useful to adjust rules, for example — when a user is especially receptive to particular type of 

20 introduction then make such introductions more likely in the future. An intelligent interface 

21 system might also suggest refinements to the rules, to automatically cover "patches" where 

22 the user currently controls interactions. For example, some rules may be based upon a 

23 certain confidence threshold, which the system suggests or refines. The user may accept or 

24 adjust the threshold as desired. S/he may wish to provide settings as to which types of 

25 autonomous actions the user wishes to authorize updates to (based upon his/her own 

26 behavior or as a default users who are very similar) and the user wishes to authorize the 

27 recommended action, or allow the system to fully automate the activation of the 

28 recommended actions (or do so only if the confidence threshold is above a certain level). Or 

29 for example bypass user authorization of a recommended action if it falls within a certain 

30 margin of confidence below the normal confidence threshold for activating full automation 
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1 if the user is performing another important task , e.g., in a meeting, engaged in an important 

2 phone conversation 9 

3 13.2.2.1 Resolution Credentials 

4 Resolution credentials from third parties can play an important role in agent-agent 

5 interactions. We can prove the absence of a quality attribute or behavior (which is often of a 

6 negative nature) relating to an individual and is submitted by a third party and typically must 

7 be issued on a periodic basis in order to maintain currency. A few simple examples of 

8 resolution credentials which may be of interest to users (credentials which users may 

9 commonly request as a precondition to requesting or accepting requests to be introduced or 

1 0 initiate communication with an outside unknown third party) include: 

11 1) For business associations: are in good business standing, e.g., have not attempted 

1 2 to defraud other users in the course of common business practices. Or maintain 

1 3 sufficient funds in one's account to perform business activities (as represented by 

14 the user). 

1 5 2) For business interactions or social interactions: are in good standing with the law. 

16 3) For social interactions: have not been accused by other individuals of 

1 7 inappropriate or antisocial behavior. 
18 

19 Some standard credentials which may be of interest to many users, and which may 

20 (as with resolution credentials) be incorporated with the standard settings of the user's data 

21 request policy as herein described. A few examples are cited (among countless potential 

22 others): profession, awards, honors, alma mater, e. g., Harvard graduate, doctorate degree, 

23 etc. In accordance with the parent issued patent US Patent #5754938, various credential 

24 issuers are provided for issuing standard and resolution credentials to individuals. Thus 

25 certain entities may be entrusted with "legitimate authority" to validate and submit 

26 credentials which are issued to the appropriate individuals. If a resolution credential is not 

27 issued (or not renewed) an adjudicating third party is provided which has access rights to 

28 both of the parties is provided to resolve resulting disputes (from the subject user). The 

29 present invention describes how credentials can be issued to users pseudonymously. 



Page 281 of 370 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 There are a variety of rules which a user's data disclosure policy and data request 

2 policy may contain, to control what if any attributes are released, and what credentials are 

3 required. A data request policy may state a rule for explicitly notifying the user if a 

4 particular resolution credential (e.g., indicative of a serious problem or concern) cannot 

5 be presented in response to the user's disclosure request. We allow initial information 

6 exchange to be anonymous, such that information that is released as preconditions for 

7 release of further information is not useful. Similarly, so long as initial encounters are 

8 anonymous there is no need to withhold information about them from the user. 

9 Some users may not wish to disclose specific information about themselves via these 

10 standard credentials but instead certain "extracted" more general information may be 

1 1 provided about themselves. For example, instead of a "Harvard grad or Ph.D." there may be, 

12 for example, credentials indicating "intellectual" or "prominent intellectual". Or instead of 

1 3 indicating an individual's wealth or value of assets, the credential may indicate "wealthy" or 

14 "very wealthy" (typically, depending upon user's wishes this latter credential should also be 

15 withheld during initial introductions or subject to some fairly stringent conditional criteria 

16 from the other party) and instead replaced with an even more general credential e.g., 

1 7 "prominent" or "influential citizen"). Similarly, an individual's exact profession or scope of 

1 8 work may not be fully disclosed initially but rather a more general definition of his/her 

1 9 profession or perhaps the general field initially in which the user works or his/her particular 

20 views on certain social or political issues may instead be substituted with (for example) 

2 1 "socially conscious", "philanthropic", or "moderately liberal". 

22 Another example of a credential of potential interest may include the profiles of 

23 users which a certain individual associates with or is acquainted with. The ability of a third 

24 party to gain access to this information, however, is conditional upon the data release policy 

25 of that associate's or acquaintance's data (e.g. it could be affected by what" is the profile of 

26 the common acquaintance to whom that user would be disclosed as an associate as well as, 

27 importantly, the profile of the prospective disclosee.) In one variation, the system may 

28 simply identify the fact that there are common associates and acquaintances between the two 

29 individuals. Again that associate's or acquaintance's data release policy may further control 

30 even detection of this fact. It may instead also notify one of the parties of this fact, but 

3 1 request that it not be disclosed to the other party. 



Page 282 of 370 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 In accordance with the parent patent application, rules may be learned regarding 

2 certain things that a user does (as in ascribing these rules for which messages to send to 

3 whom or what user profiles and under what circumstances/events surrounding the target 

4 user). Thus, his/her agent may begin to suggest certain future actions which could be 

5 performed in the future upon user approval or even automatically. If the user has had no 

6 previous interaction at all with the system, it may identify which other users of the system 

7 the present user is most similar, and recommend initial rules. Additional textual attributes 

8 can also be leveraged to provide extra criteria, and data mining techniques used to generate 

9 more appropriate rules. 

1 0 Another category of user credentials include features that may be inferred implicitly 

1 1 by location/time data captured by LEIA. Such information may reveal a user's likely 

1 2 behavior and activities. These inferences, however, are unavoidably somewhat speculative 

1 3 and inconclusive, thus cannot be substantiated on a valid basis for issuing credentials. The 

1 4 data may be useful in suggesting the present context and circumstances surrounding a user. 

1 5 Additionally, the communications which the user may be presently involved in i.e., 

1 6 the content profile of his/her spoken dialogue and/or other "on line communications" may 

1 7 be used and combined with location/time patterns in order to further infer the circumstances, 

18 behavior, and present temporal interest of a user and/or third party for purposes of 

1 9 employing the user's data disclosure and data request policies. 

20 Credentials can allow users to identify other users that may pose a threat. This 

21 identification may be provided vis-a-vie resolution credentials and/or rating (by third 

22 parties), e.g. a user has not engaged in any serious criminal activity, physically harmed 

23 another person, or interacted with other individuals who are unable to produce these 

24 resolution credentials. Other credentials may specify the nature of an infringement, and its 

25 context and severity (e.g. what was the context of a physical assault? Was it performed 

26 during a bar brawl, against a friend, a boss, an elderly person, a child, a family member - or 

27 at work? In this case, the user agent may, for example, bring to the attention of a 

28 prospective employer that the user could not present a credential indicating that they had not 

29 previously harmed or threatened a former employer. Was it minor or severe? Also, if such 

30 individuals (lacking, for example, resolution credential proving the absence of having 

3 1 committed armed robbery) are (or come) within a certain proximity of a user, the user may 
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1 wish to program his/her user agent to notify the user. The same would, of course, apply to a 

2 store clerk regarding customers of this sort or to baggage security personnel at an airport. 

3 Or, highway patrollers may be interested (e.g., on certain stretches of highway) in being 

4 made aware of vehicles and their locations whose agents are unable to provide a resolution 

5 credential proving the absence of a drug conviction. 

6 In another application (in accordance with the auto insurance risk determination 

7 methods described in co-pending patent application entitled "Applications for Location 

8 Enhanced Information Architecture" [U.S. Patent Application # 09/314,321]), an on-board 

9 computing device within a user's automobile could identify another automobile lacking, for 

10 example, a resolution credential for safe driving, i.e. the on-board user agent continuously 

1 1 polls agents in other cars for a "safe driving" credential, and if it fails to receive such a 

12 credential it issues a warning to the user. As an extension, this location data could be 

13 converted into a dynamic 2-D rendering upon the user's windshield (using heads up display 

14 technology) in order to thus superimpose a persistent flagging or highlighting of that 

15 particular automobile from the driver's visual perspective. Pedestrians or low enforcement 

1 6 officials (for example) could also receive instant notification. As is described in LEIA, a 

1 7 roaming cellular connection, or GPS, is not essential for providing a user identifier. For 

18 example, optically-based biometric identification techniques such as iris scanning or 

19 combined iris/facial identification techniques may be used among other potential inputs as 

20 well. Users will be reluctant to release location/time data, even anonymously, when 

2 1 suspicious behavior can be inferred — probably subjectively. 

22 In the case of suspicious behavior we can provide access to law enforcement 

23 officials, through seizure of the decryption key for that data (which includes his/her physical 

24 location information) and any additional profile data which is considered of immediate 

25 critical relevance to the suspect (or prospective) infraction. Such cryptographic techniques 

26 for key seizure from a key escrow are well covered in the literature [FRED, WHAT ARE 

27 THE REFERENCES?]. There may also be certain circumstances in which key seizure may 

28 be required after the fact (at some time in the future). For example, itfwhen certain even 

29 moderately "suspicious" behavior patterns are detected, it may be possible for the SDI data 

30 warehouse to preserve a comprehensive record of that information (and perhaps the record 

3 1 of that user which precedes and follows that period of interest). Thus preserving evidence 
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1 which may later prove useful in contributing evidence towards a conviction, acquittal, e.g., 

2 proving that a user was not at a particular location/time. A record containing more detailed 

3 segments of a user with a proven negative or questionable history may be preserved and 

4 general location/time features may be abstracted for the remaining portions of the record 

5 (thus compressing the record substantially). This may be performed for regular individuals 

6 as well, thus retaining key relevant features while discarding the majority of the record 

7 which is irrelevant or redundant. 

8 Within a location enhanced context, unless the prescribed range of "proximity" to 

9 the user is quite large, securely protecting the user's identity from malicious third party 

1 0 collusion (for purposes of combining unique pseudonyms and/or exchanging data that has 

11 been released and entrusted to them) is a harder problem. The system could (most 

12 obviously) assume data exchange between the parties will occur and limit the combined 

1 3 disclosure to only that of the most data restricted user in a given location/time domain. The 

14 system could alternatively, perhaps "space apart" the number of users within a given 

15 location/time context who can access more "restricted" user data (of course the problem 

16 goes away if all the discloses have similar disclosure restrictions by that user). The use of 

1 7 resolution credentials within the context of a location enhanced (physical) environment will 

18 be hard to implement practically until there are powerful and miniature mobile devices. 

19 Automobiles may be an exception, as may be technology which enforces the disclosure of 

20 the physical presence of a user agent (resident in a device) to other user agents within the 

21 networked environment. 

22 Continuation of the Addition to Autonomous Information Disclosure to Third 

23 Parties 
24 

25 The above description suggests the use of a statistical model which can take 

26 implicit passive data about the complete set of variables surrounding a particular instance 

27 in order to determine whether, and if so, what particular data the user is most likely to 

28 wish to disclose, such as the identity of the individual, the profile of the individual (e.g., 

29 what other users s/he is most similar to that have also interacted with the user), the time, 

30 location, inferred activities and content based attention focus (which may utilize the 

3 1 above variables in addition to passive monitoring of spoken content and facts surrounding 
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1 conversations). A few concrete examples may be worthwhile suggesting .The system, for 

2 example, determines that a physical introduction between two individuals may be of 

3 potential interest to them and compatible with their schedules. In fact, they are able to 

4 briefly connect (perhaps pseudonymously initially). Mr. A suggests to Mr. B that a lunch 

5 meeting may be a potential interest (assume that at least one of their calendering agents 

6 does not have them committed to a fixed schedule prior to lunch time and one of these 

7 individuals is Mr. B). Mr. B replies also indicating a potential interest, however, that he 

8 has recently dropped in at a friend's house and wishes to stay at least 20 minutes longer. 

9 Based upon the schedule of Mr. A, SDI receives data from the navigational agents of Mr. 

1 0 A and Mr. B which take into account the distances, drive time (based upon the preferred 

1 1 routes, the average speeds which Mr. A and Mr. B tend to travel for similar types of 

12 interests, and the cafe or restaurant which is closest to the point at which they rather are 

13 likely to cross. Mr. B, may state the constraint to his agent, i.e., the 20 minute minimum 

14 period prior to departure. Or the agent may interpret the accurate interpretation of this 

1 5 fact from his/her conversation with Mr. A. Or if the meaning or the words are unclear, it 

16 could even automatically reiterate its interpretation. The system may tell user Mr. B that 

17 in order to meet up with Mr. A (based on Mr. A's schedule) he must leave his friend's 

18 house in 10 minutes and forego the regular stop at the convenience store. Mr. C may be 

19 another potentially interesting individual for lunch, however, Mr. B's agent doesn't know 

20 all of the facts about how Mr. B may be willing to conform his schedule to meet Mr. C 

21 with regards to changing or rearranging a certain portion of his schedule, and/or replacing 

22 this portion of the schedule with another activity or engagement (an original activity 

23 which may now be in conflict due to the drive home and physical direction to visit Mr. C. 

24 Mr. B's agent thus identifies and conveys to Mr. B the salient changes to his schedule and 

25 the travel route within this possible alternative plan to meet Mr. C instead. Thus before 

26 Mr. A commits to Mr. B about lunch he has details about all of his feasible options for a 

27 lunch engagement. Mr. B may even state to his agent that that stop at the convenience 

28 store is a relatively fixed constraint (that it is important) and given that constraint what 

29 are the available options for a lunch meeting as such. 
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1 Note (as suggested) that at any time the system identifies within a certain 

2 probability threshold for that user that there may be a certain engagement (or other 

3 activity) which could be scheduled (or replace another schedule activity) the system: 
4 

5 1 ). Determines the predicted interest-level of that potential engagement or activity (or if it 

6 may involve replacing more than one activity/engagement, determine that the predicted 

7 interest of the combination thereof and/or it may enable other activities/engagements to 

8 be scheduled which may be replacements for others (estimate the predicted interest of the 

9 activities/engagements collectively comparing one scenario to the other). There may be 

10 other factors affecting user interest in one scenario versus the other, e.g. the distance or 

11 travel time with which it is associated. The system may also be uncertain regarding 

1 2 certain important determining variables, e. g. the profile of the user or even the present 

13 mood or activity or contents focus is not adequately determined, or the nature of the 

14 proposed activity or engagement is not sufficiently constructed (or robust) within the 

1 5 user's profile. In such cases, the user's agent can also ask the user direct questions in this 

1 6 regard using a national language speech interface and (ideally) at the opportune moments 

17 in which the determination of certain facts as these within the user's present user profile 

1 8 would provide the ability to identify or dismiss for the user certain opportunities as they 

1 9 occur. 

20 In light of the present context of the situation and potential opportunities, the 

21 questions may be selectively presented in order of priority and are typically related to the 

22 attributes characterizing present opportunities/individuals which are either of predicted 

23 interest or of unknown predicted interest/relevance but may possibly also include 

24 potentially predicted relevant questions about the present frame of mind of the user, such 

25 as what the user's present activities are (i .e., including work , leisure, etc.) mood, present 

26 focus on what kind(s) of content, etc. 

27 In the above example, of course, Mr. A and Mr. B could instead be established 

28 friends or colleagues in which case keeping secret their present physical locations may be 

29 considerably less important but not necessarily so. Revealing informational details about 

30 location, action, attention focus to another third party while completely leveraging this 

3 1 information to provide optimal scheduling efficiency is all possible with SDI and many 
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1 individuals may wish to not disclose it in this way (as a default) even if the disclosees are 

2 relatively trustworthy. 

3 Complete robust functionality coupled with complete user privacy as provided 

4 above through the user's agent provides a compelling motivation for users to maintain 

5 much of the information, regarding their present and general location and activity related 

6 data, private in the majority of cases of user interaction and associated schedule 

7 coordination. If Mr. B were to pose different relevant queries to Mr. A's agent while 

8 determining whether he could be interested in meeting, for example, where/when their 

9 paths may cross based upon different times and/or points of departure, the specific 

10 location and travel information regarding Mr. A. may still be fuzzed as the meeting time 

1 1 at the associated place of rendezvous does not disclose to the other party when the other 

12 will arrive, only that time when both will be there at the same time(for those rare cases 

13 where even this information may disclose likely specific location and intended time of 

14 departure, we can apply randomization techniques in order to assure privacy with 

1 5 relatively minor negative impact upon scheduling efficiency. Nevertheless, how much 

16 privacy versus efficiency is desired is entirely within the control of the user. 

17 In another application of the system, the individual may wish to allow another 

1 8 user (or entity) to be able to arrive at certain levels of deductions or conclusions about the 

19 user which may concern them without explicitly revealing any facts or details used to 

20 arrive at those conclusions. Or in a variation, even allow a continuous persistent 

21 revelation of information regarding these certain conclusions such that if a certain 

22 conclusion(s) occurs (or ceases to occur) notify the requester of such facts. A statistics 

23 and/or manual rules-based approach could be used in this instance. A statistical interface 

24 provided to SDIs central data warehouse could enable an expert to establish correlations 

25 and confidence thresholds appropriate for deducing certain conclusions. For more novel 

26 or complex rules which haven't been seen before SDI could receive disclosure from users 

27 regarding their actions and a request as to desired types of conclusions. The industry 

28 could further trigger an anonymous action (and/or notification) on behalf of the user or 

29 depending on the privacy policy (consent of the other party) an action could even be 

30 triggered based upon such conclusion in the absence of disclosure of the conclusion 

3 1 and/or even the autonomous action taken. 
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1 E.G. notify the subject user of interest in message (or suggestion or warning) 

2 another relevant, noting third party, etc. In such cases, it may however be acceptable for 

3 the user to be able to receive fuzzed aggregate statistics regarding such data as a user's 

4 past behavior patterns (or more confidentially) other users who share similarities to the 

5 user regarding his/her user profile and/or perhaps context of similar activities and 

6 circumstances. User profile could be more general characteristics of the user than the 

7 detailed user profile as suggested herein, e.g., the user's corporate department peers, 

8 class, demographics, psychographics, travel venue or combination thereof. This process 

9 may be performed in either a pull or even push mode. Many associated applications are 

10 conceivable, for example, a spouse (the requester) may desire to have his/her agent 

1 1 persistently issued a resolution credential indicating that all of the activities of the user 

1 2 while on a business trip are consistent with behavior which is appropriate for a married 

13 individual, e.g., that the associations with business associates/colleagues are maintained 

14 at a professional level (per that mutual physical locations and perhaps even directly or 

15 transmitted spoken or written communications) or that user doesn't engage in late night 

16 carousing at questionable venues, or , for example, that the individual on his way to 

1 7 work, to a meeting worked diligently on an important proposal or on his/her way home 

18 from work without revealing any further details which his/her agent has explicit 

1 9 knowledge of with which it deduces these facts with confidence. Another example might 

20 involve the use of LEIA via small cellular transmitters (with or without GPS) affixed to 

21 small children or the elderly whereby location and activity related information is 

22 maintained private, unless an inappropriate action is performed (e.g., motion and/or 

23 acoustic detectors in the elderly person's home suggest s/he may have fallen, that the 

24 child has wandered away from parents or is conversing with an inappropriate stranger. In 

25 such cases, notify the guardian or assistant in charge or allow a highly trusted party who 

26 happens to be in the immediate vicinity of the party to be notified to come to his/her aid. 

27 In the latter case (of the child) perhaps there is suspicion that the child has just 

28 disappeared and the parent(s) authorizing the agent of the child is able to send out an 

29 immediate description of the child to all certified "trustworthy" parties who are in the 

30 immediate vicinity, e.g., after the initial description has been sent out, one of the parties 

3 1 indicates the child was speaking or walking with an apparent stranger of X description 
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1 which is also, in turn, transmitted to these trusted local parties (and at that point to the 

2 police). In a variation of the above example, a crime could have just been committed and 

3 the assailant has just been described in detail by the victims with information regarding 

4 location and direction last headed which is again transmitted to other parties: 

5 Thus, more immediate, complete and up to date information of importance for 

6 police to make a more immediate apprehension of the suspect is made available to police. 

7 INSERT»»»»»»> 

8 Location determination via LEIA could be either through user's LEIA-enabled devices or 

9 the coordinates of the user(s) as identified verbally by the user, e.g., to a 911 server. 

10 Other examples of the present system could apply to immediately locating trained 

1 1 medical, EMS trained or rescue personnel in proximity of a urgent medical emergency or 

12 accident Such personnel could even be off-duty. Depending upon the particular nature 

13 of a medical emergency, particular specialists appropriate to the circumstances could be 

14 identified, their comparative physical proximities compared with other potential qualified 

15 individuals and based upon qualification appropriateness, physical distance and perhaps 

16 willingness, a most appropriate match made. In another application, likely witnesses to 

17 an accident (including automobile accident), theft abduction based upon their location 

1 8 and time relation to the event of interest could be notified to provide helpful clues in 

19 assisting investigators in obtaining a suspect or legal professionals a conviction or 

20 acquittal. In the case of abductions timing in acquisition of such information could be 

21 extremely critical and the ability of such notification to be distributed instantly to all 

22 "trusted" individuals in the immediate vicinity at the very outset of when a person (e.g., 

23 child) is even suspected of being lost could prove to be invaluable in not only rapid 

24 recovery but also in thwarting of those few instances in which a kidnapping has actually 

25 occurred. Some users may wish to be available and willingly notified to help in certain 

26 dire circumstances (the nature of which the user ultimately has control over) but may 

27 wish to remain anonymous to authorities and legal professional after the fact (or for 

28 example in cases of providing witness testimony (as via a subpoena) to more trivial or 

29 non-life threatening cases. A market model may also be created in which compensation 

30 schemes which are just appropriate to compel typical individuals to provide desired 

31 assistance may be provided. Finally, the present methodology could be readily extended 
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1 to a taxi dispatching service in which a taxi's most local to a given user can be dispatched 

2 accordingly. Another example includes employees within a large organization. The 

3 statistical techniques as described in the parent patent application could be trained such 

4 that examples can be identified or explicitly stated rules provided by employers of what is 

5 considered to be actions and behaviors which are inappropriate and irrelevant to the type 

6 of work related activities of that employee's job description (ideally trained across 

7 previous "similar" employees). The system can take into account browsing, 

8 communications, (including spoken communications), even location (LEIA) data into the 

9 model (e.g., for tele-commuters) and flag suspicious behavior. SDI can even upon 

10 agreement by the employee) allow the employer to access more detailed information on 

1 1 the specific behavior once the suspicion threshold has been exceeded. 

12 In another application, the user may wish to provide predefined rule( which could 

13 be suggested by the system initially in (similar fashions as suggested above). s which can 

14 be used to notify a certain individual(s) if certain explicit (or agent) inferred events occur, 

15 for example, that the user is in town, has just received certain important news of a 

16 promotion or change to a new position, internal company news which relates to that 

17 individual and may have professional relevance on the other individual (the latter two 

1 8 examples which may be relevant to say the professional profile of that individual , etc.). 

19 In addition, the individual(s) may be made aware of the event disclosure policies set forth 

20 by the user relating to them specifically (if desired) and this individual(s) can further 

2 1 provide filtering rules which allow all or a portion of that information to actually reach 

22 the user. In another example, the user may have met or known the individual at some 

23 time previously and the venue and context of the meeting circumstances and relevant user 

24 profile information to that context are known and disclosed by both parties. The user 

25 then wishes to re-establish contact perhaps defining the reason for the request and the 

26 individual can have predefined rules or acceptance or rejection of the request directly. 

27 In a final example, virtual tags are provided containing comments and annotations 

28 regarding a certain physical object (which could be anything from a tourist attraction to 

29 the physical site of a recent hot news event to a useful annotation critiquing a bricks and 

30 mortar vendor). An individual whose profile suggests s/he is knowledgeable (or had 

31 provided the relevant annotation of interest) about that object would allow this relevant 
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1 information to be conveyed to other individuals who have perhaps an interest in that 

2 information (generally or at that moment). 

3 The relevant individual (if s/he is also willing) may be contacted and solicited to 

4 meet if their calendaring agents can appropriately coordinate the meeting. It may be 

5 either context specific, e.g., both individuals are physically local (or plan to be local to 

6 each other) and the physical object or site of interest or simply (generally) local to each 

7 other. 

8 It is perhaps worthy to note that the present methodology could certainly be 

9 extended to include the application to rules dictating reachability conditions of users by 

1 0 other users via standard telephone. The application to cellular telephony is of particular 

11 interest in its ability to communicate to the user in a variety of modalities (voice, 

12 notification, instant messaging, Web content). In addition, similar rules may be adapted in 

1 3 this scenario with perhaps minor modifications. For example, under what conditions may 

14 the user be reached by another user. E.g., what is the nature of the relationship between 

1 5 those individuals? If it is not known, inferences from the context of previous conversations 

1 6 may be determined. Does the user typically accept calls (or other communications) from the 

1 7 other user under similar circumstances and/or times. If this data is not available are they 

1 8 accepted by other users similar to that user according to a key attribute(s) or does the other 

19 user accept communications from the user under similar circumstances as they present 

20 communication attempt (as this may suggest similar reciprocal acceptable reachability 

21 conditions as that of the other user. Again, LEI A suggesting location movements and 

22 associated measurable behaviors as sequential patterns and as formation of time may be 

23 important. This detailed statistical model of the user could be uiseful in learning the context 

24 of performing certain autonomous actions for the benefit and convenience of the user and in 

25 the reachability of the user by other users, e.g., in what communications mode is the user 

26 interested in communicating, e.g., speech to text-based instant messaging or standard 

27 telephony, does the user mind being interrupted with phone ringers, (or vibration mode) or 

28 ringer-off mode). Previous similar communications contexts may be useful in determining 

29 inferred rules which may be presently appropriate (e.g., was the spoken content of the 

30 previous communication indicative of acceptance or annoyance of the communication with 

31 the other user within a similar associated context . Eg., this may suggest whether and in 
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1 what contextual conditions of both accessor and accessee the accessor' s call may be patched 

2 through (or re-forwarded if initially accessing another number or telephone of another likely 

3 venue where the user may be believed to be present. 
4 

5 Exemplifications of The Underlying Statistical and Rule-Based Intelligence Used in 

6 Autonomous and Semi- Autonomous Release of Personal Data Release, Matching and 

7 Reachability by Other Users 
8 

9 The above section "Location Enhanced SDI System relating to Smart Home and 

10 Office techniques provides a relevant platform for integrating intelligence into a multi- 

1 1 device environment. Mobile user intelligence is a sub-component of this broader multi- 

12 device (or '"ubiquitous computing") application-level intelligence platform. In view of 

13 the present application framework, it is a very intriguing and challenging problem to 

14 provide a statistical framework which is able to allow the user agent to make appropriate 

1 5 inferences regarding users who generally, though imprecisely, fit a certain profile which 

16 may suggest the activation of a rule (i.e., a "fuzzy rule") such as requesting further 

17 information disclosure from that user, identifying that individual to the present user or 

1 8 more sensitive types of actions such as if the requested information from the other user 

19 requires some sort of a fee or the other user, in turn, requests certain information 

20 disclosure from the current user (which may be considered at least moderately sensitive). 

21 It is clearly extremely advantageous to entirely automate as many of these types of 

22 information exchange and notification related actions as possible in order to enable the 

23 system to function seamlessly and intuitively to the user's present wants and needs. In a 

24 general sense, it is reasonable to suggest that the key to minimizing active user approval 

25 of rule based recommendations is the ability to accurately infer present user actions and 

26 behavior, thus allowing the system to more accurately infer the present user mind set, 

27 activity related focus, perhaps even the mood which the user is presently experiencing 

28 unless the nature of the disclosure is extremely sensitive or otherwise extremely 

29 important to the user. Many of these types of actions could be performed without active 

30 user approval and authorization. There are a couple of approaches which can be applied 

31 which may be able to limit the uncertainly of the system, possibly alleviating before 
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1 hand, inappropriate or unwanted actions or failure to perform certain desired actions. For 

2 example, it might be possible to perform data analysis on very large data sets of users 

3 who share very similar profiles and to perform very similar actions and behaviors in all 

4 aspects of their physical interactions with the real world. The recommended actions 

5 provided by the system may also be used to provide the user with a natural language 

6 interface which identifies to the user those types of circumstances and conditions in 

7 which there is an inherent lack of adequate statistics from previous user experience to 

8 prove beyond a reasonable level of confidence the action desired by the user. Or the 

9 system is unable to identify with a high statistical confidence level a most appropriate 

1 0 action based on the collective statistics of other similar users and conditions (the system 

1 1 may helpfully suggest those which are possibly the most prevalent. However, in other 

1 2 cases the actions (or combinations thereof) of the user are unique enough that the system 

13 lacks sufficient statistical data, the likely appropriate actions are sharply divided between 

14 or among different possible actions or (similarly) there are no strong statistical 

1 5 correlations which can be established (perhaps because the key factors with which the 

16 desired actions are most influenced cannot be captured due to the limited input 

17 constraints to the present system, e.g., precise accounting of present activities or focus 

1 8 (even such things as the likely emotions or present attention focus of the individual for 

19 which there may be a lack of sufficient measurable clues visa- vie the suggested input 

20 modalities. The natural language interface (perhaps an application for a language 

21 generation system) could communicate to the user its interpretation of his/her behavior 

22 and the statistical correlations with recommended autonomous actions. E.g., if the user 

23 is in his/her home study between 9-1 1PM ? she/he is likely to be engaged in work relative 

24 to a research publication thus an invitation to other colleagues in this area will be 

25 extended and/or communication requests accepted .to contact the user accordingly under 

26 these conditions. The system may detect similar conditions in which the user may reject 

27 requests to correspond by such individuals under these conditions, e.g., if the identity or 

28 identified context (or even context inferred from the identity) is inappropriate for that 

29 particular context of user activity. The user may provide another annotation to the rule 

30 which says that if she/he has recently corresponded with another individual(s) at that 

31 time/location, chances are that she/he is, in fact, in a social related mind set. As 
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1 indicated, there are numerous potential inputs to the system, which could be considered 

2 in predicting what agent-mediated actions are, in fact, appropriate. The techniques of the 

3 parent patent application, if applied in this context, would allow us to make strong 

4 inferences about the particular mind set, reflecting present interests or preferences which 

5 the user is likely to be receptive to, presently based upon such clues as who the user is 

6 presently interacting with, the content profiles of the present real-time dialogues, e. g, 

7 typed or spoken through a communications network, (or simply passively collected off- 

8 line), the object profile of documents or web pages being interacted with. Activities may 

9 further be inferred by such indicators such as the particular physical location of the user, 

10 visa- vie LEIA, the particular sequence of location specific movement patterns, the 

1 1 particular types of devices with which the user is interacting )( and perhaps the nature 

12 and/or sequence of those interactions). More complex rules are certainly also 

13 conceivable, such, as the user will not interact, transact except under certain specific 

14 conditions, such as a commercial opportunity if the user fits a certain stereotype, i.e., 

15 similar to those previously interacted with for the nature or content of the proposed 

1 6 discussion, in light of the type of user, may be acceptable under certain circumstances, 

1 7 locations or mind sets experienced by the user. E.g., a young female in her twenties may 

1 8 not wish to be approached by 40 year old males unless the context of the interaction were 

19 purely business, e.g., a professional, or commercial or sales opportunity thus if she were 

20 in a purely social mindset such requests to interact would be denied. Likewise a busy 

21 executive in his/her 30' s may deny such requests from the young female unless her 

22 request was purely of a social nature and he was not overly time constrained. There may 

23 be certain "professionally" prospective correspondents, however, which he may prefer 

24 over others based upon some additional interesting criteria as well such as other 

25 business/professional areas of commonality, common interest areas, even socially 

26 compatible or interest criteria. 

27 Of course, SDI is able to extrapolate extremely sensitive features regarding 

28 interest preference and context which may establish a basis where two individuals or 

29 more individuals, to be mutually identified to one another and/or communicate 

30 accordingly or to not be mutually identified and/or accordingly communicate (SDI may 

3 1 even establish the basis for these mutually beneficial interactions even if revelation of 
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1 this basis by one or more parties by the other is not permitted within the privacy policy of 

2 one or both of the parties). Of course, SDI may frequently identify individuals whose 

3 personal data revelation requires certain minimal requested personal data from the other 

4 party as a condition to such revelation (which SDI is capable of fully automating on 

5 behalf of both parties) or the user may request certain information and make a personal 

6 judgment in real time as to whether further information about him/herself should be 

7 revealed and/or whether introduction or communication is desirable. In these cases it 

8 may be difficult to always successfully achieve an introduction when appropriate as the 

9 user is unable to guarantee 
10 

1 1 a). The revelation of certain information to the other use if that user reveals certain 

12 information accordingly, or 
13 

14 b). Is willing to agree to be introduced if certain desired information is revealed by the 

1 5 other party (often busy executives have absolutely no time to disrupt their busy schedules 

16 to discuss certain opportunities when there is enough revealed about those opportunities 

17 to justify the correspondence. However, the nature of the opportunity is too sensitive a 

18 piece of information for the disclosing party to be able to reveal without certain 

19 guarantees that there is sufficient interest for the recipient to discuss the potential 

20 opportunity further. This quandry unquestionably poses a huge challenge, however, it 

2 1 can be somewhat overcome by: 
22 

23 a). SDI';s ability to keep track of most, if not all activities, behaviors and 

24 correspondences, particularly acceptance or rejection of those SDI pre-qualified 

25 recommended interactions, 
26 

27 b). Detailed information provided by the individual or entity to SDI as to the nature of 

28 the various types of individuals, entities and potential opportunities to conduct business 

29 which the individual is able to provide to SDI. SDI itself may apply human experts for 

30 the task of interpreting the compatibility of certain requests with the access 

3 1 correspondence policy of the individual, provide a rules based system (which can also 
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1 learn with fuzzy rules) with which to automate some of the decision or, at least automate 

2 the "definitive" exclusions). 

3 13.2.2.2 Applications 

4 User and seller information - as detailed in the parent issued patent, specific details of 

5 what users and sellers may be looking to buy or sell respectively may be used to 

6 suggest the basis for a potential commercial transaction. The transaction may be 

7 large (but not necessarily so, e. g., real-estate, private investment in a small 

8 business or public stock). If a physical or on-line interaction with the other party 

9 is warranted (e. g., for larger commercial transactions), as is suggested later as in 

10 the present description, users may identify other users which form the most 

11 relevant "match" with their interest. At this point the agents can check for 

12 credentials, and then either communicate or calendar a meeting. Similarly, the 

13 agents may find the "best" match of users who happened to be physically 

14 proximal to the user at that particular time, or at some future time(s)/location(s) 

15 which is mutually compatible (similar applications are suggested for matching 

1 6 sales persons with prospective clients, identifying experts to work (individually or 

17 collaboratively) on a particular project or problem, to answer a question of an 

18 appropriate specialized nature to their area of expert knowledge.) The parent 

19 issued patent suggests at a general level these commercial applications. An 

20 additional feature described therein involves the use of a decision tree called 

21 "Rapid profiling" which can be used in the present context to identify from the 

22 most common needs of users and "goods" of sellers in general and the known 

23 profile data about each user and seller individually, a list of questions for each 

24 party which most briefly and efficiently determines the complete user/seller 

25 profile of each party individually. 
26 

27 Social Interests Profile Information — The parent issued patent also suggests the 

28 present application at a general level. For a dating application, users may be 

29 matched on the basis of their common interests/preferences and perhaps on the 

30 basis of certain information reflecting personality, social or cultural 
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1 behavior/affinities or psychological attributes. On the other hand, for purposes of 

2 meeting casual acquaintances, users may be interested in another user who shares 

3 the above characteristics as well as someone who has recently shared similar 

4 experiences and/or personal challenges. 
5 

6 Professional Information/Qualifications - As in the application of matching users and 

7 sellers, a description of a user's needs or situation with relation to various 

8 professional services may be provided as additional data about the user. 

9 Examples may include: (as above) medical data, professional or business history 

1 0 (as well as legal history) which may be of interest to law firms, accounting firms 

1 1 or various business consultants. Personal, family or emotional difficulties may be 

12 of interest to psychologists or family counselors. Again, users may submit this 

13 information as a query for prospective matches, or they may be pseudonymous 

14 queries or automatically matched in accordance with criteria specified by the 

15 professional. The issued parent patent application also lists additional 

16 applications, which could as well be relevant within the usage context of virtual 

1 7 tags. 
18 

19 Employer/Employee Information - An employer may post a description as part of 

20 his/her virtual tag (and that associated with his/her company). His/her employees 

21 may also have provided ratings and/or annotations, which are further descriptive 

22 of his/her personality, leadership/management style and skills, work environment 

23 which s/he promotes and overall quality. A previous employee may allow 

24 him/herself to be contacted by the prospective candidate (e.g., in exchange for a 

25 fee). 
26 

27 Access Privileges Information - Users in an organization are frequently given 

28 privileged access to certain files within a corporate intranet but not others. 

29 Though there are many ways of profiling users according to their level of access 

30 privileges to information, the following example is considered: Based upon the 

31 position (e.g., responsibilities and tenure with the organization), users may be 
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1 "classified" into groups according to different levels of access to confidential 

2 information. Virtual tags may be used to extend the capability by providing for 

3 immediate disclosure of a user's information access privileges to another 

4 employee in real-time and in a physical context. Also, if a user reads or accesses 

5 certain information, meets with a certain colleague or friend, then a user might 

6 send a message X. This message could be (for e.g.) a request to perform some 

7 task relating to part of that information, a reminder to address certain issue(s) 

8 while chatting with the colleague etc. or, per the request of an individual's 

9 employer or colleague if a given individual (a sales person) meets with user X 

1 0 send him/her message Y (which may refer to a previous encounter, experience or 

1 1 fact s/he should know pertaining to user X and which may have bearing upon 

12 their conversation or professional interaction. 
13 

14 Access restricted physical areas. In this case the virtual tag effectively may behave 

15 like an "electronic door key". A variation of the technique may be used for 

16 granting access to professional meetings, where information access privileges of 

17 users must match the anticipated confidentiality parameters for the scheduled 

18 meeting. Another application may include the ability to automatically enable 

19 access or restrict access, based on payment of fees, and whether or not an 

20 individual is a representative or partner of a competing company. 

21 13.2.2.3 Business Introduction System 

22 Another application domain for privacy-protected match-making, where users are 

23 anonymous until an agreement is struck is business to business introductions. For example, 

24 it might be useful to automatically identify synergies between businesses (e.g. in 

25 infrastructure, technology, or product) ~ for the purposes of pursuing an advantageous 

26 strategic relationship. If the meeting is between two employees of competing companies, 

27 then the system of match making could also ensure that a meeting is predicated on a 

28 particular task that does not cause conflicts with their respective companies. 

29 Each business might state within SDI: 1) what entities or types of entities s/he would 

30 share resources with. 2) If so, on a per-entity or per-entity type basis, what types of their 
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1 resources (e.g. type of skilled employee and for what TYPE of out-sourced task) would the 

2 entity share. It is an obvious extension to look at sharing of code, technology, intellectual 

3 property. A major challenge and limiting factor being how well informed SDI, the neutral 

4 intermediary can be made aware of the needs/requirements of a company such that it can 

5 make evaluations entirely on its own regarding highly confidential materials with which it 

6 can accurately predict the basis for a deal WITHOUT disclosing to the prospective recipient 

7 what the technology or know-how entails (which could compromise the value of that asset 

8 should a deal not eventuate). 

9 Corporations may also use SDI to utilize the above information regarding their 

10 human and technology sharing synergies in order to detect and recommend strategic (e.g. 

1 1 equity sharing, merger, acquisition etc.) relationship opportunities between the entities. B to 

12 b and even b to c user centric SDI user behavior analysis combined with text analysis 

13 should also provide revealing clues about what types of companies tend to share similar 

1 4 customers and provide similar (complementary or competitive) products and services which 

1 5 may suggest that such synergies are potentially available. Of course the vendor centric SDI 

16 service may benefit from this information as well as provide information for other vendor 

17 identification of competition. Other data resources including electric payment protocols, 

1 8 EDI, automatic check payment, check services, etc. may be useful data resources as well. 

19 Again the disclosure of detailed business information is very helpful and a data 

20 release policy defining the parameters for such strategic initiatives may be critical in order to 

21 determine what companies may be potential candidates for which initial feelers (of high 

22 level information disclosure) would be appropriate to put out to a prospective company to 

23 determine mutual interest and/or further basis for expected synergies. 
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1 13.2.2.4 Dynamic Annotation/Information Filtering 

2 In this extended application of SDI, we allow users and other third parties to 

3 annotate objects (physical and virtual) with meta-information, either to remind 

4 themselves about a previous interaction in the future — or as a system of "knowledge 

5 learning", where systems of users leave useful information for other users. Information is 

6 left in the environment, leaving a trail for other users. 

7 For example, the information that is tagged to an object, referred to as a "virtual 

8 tag", can contain a pointer to other relevant information, such as a survey of a film by a third 

9 party, or the user's own comments/feedback. For example, a restaurant listing could be 

10 annotated with meta-information about the quality of the food and service. Such 

1 1 information, when provided by a wide sample of users, can provide robust information 

1 2 about objects. The information that is used by a particular user can be filtered — for example, 

1 3 weighting the opinion of a respected restaurant critic, or weighting the opinion of users with 

1 4 common profiles (when that information is available). 

15 Virtual tags (i.e. meta-information) can be assigned to objects with physical 

1 6 locations, and the information triggered based on the physical location of a user (using LEIA 

17 technology). Virtual tags can be assigned with expiration dates or other time-sensitive 

1 8 information. An individual user might leave an "action item", for example — next time I 

19 return to this object (e.g. web page/ vendor) be sure to perform this task, enter this query, 

20 check this link for new information. As another example, after a conversation with an SDI- 

2 1 enabled user it is possible to tag that user with some notes, to remember the conversation the 

22 next time the two users meet. 

23 The technical innovation that allows this use of virtual tags, in addition to the 

24 protection of privacy, is that we allow users to annotate information to objects that they do 

25 not directly own through a system that separates virtual tags from the content that is tagged. 

26 In particular, tags can be stored (either at the ISP-level proxy, or main SDI server) for 

27 associated web pages, and exchanged/retrieved automatically when the object is accessed. 

28 The virtual tags can be used in conjunction with target-object profiles that are generated 

29 through SDI for web pages (and approved by vendors). Virtual tags can be searched, using 
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1 relevant terms, locations, or times, and can also contain links to authoritative information, 

2 such as audio and/or video. 

3 For example, in accordance with the prediction that readily deployable visual video 

4 recording devices will become commonplace (even integrated into wearable computing 

5 hardware). Users who are amenable to releasing such information under terms of their 

6 privacy policy may allow other individuals subsequently visiting the same physical location 

7 physically or virtually to gain access to such information. Such information may range from 

8 detailed accounts, assessments of value, etc. A user could, for example, do a general query 

9 about locations e.g., what is the percentage breakdown of types by their pseudonymous 

10 attributes, who frequently visit this site which sites tend to have commentary of a particular 

1 1 topical nature of interest to user U, which sites are visited frequently by individuals similar 

12 to user U, which sites have had an event of a particular type or one which is similar to user 

13 U's user profile occur in the recent past (or where or how close did such event occur with 

1 4 respect to the recent location of user U and so forth). 

1 5 Tags are encrypted, so that only SDI-enabled users can access them. Tags are also 

1 6 associated with the pseudonymous ID of the user that left the information (although they can 

17 be anonymous, an associated profile allows more accurate collaborative filtering 

1 8 techniques). Finally, users can leave data-disclosure policies, embedded into tags — to certify 

19 the properties of other users necessary to release the information. When tags automatically 

20 are time-stamped with location, and time, and other information we allow for this 

21 information to be "fuzzed", as disclosed in the section on Randomized Aggregates, to 

22 protect a user's identity. 

23 In the physical world, implementation of meta-information in a user's physical 

24 information, can be viewed via head-up displays, video cam monitors, wearable computing 

25 devices, or audio pieces. The information itself can be embedded directly on physical 

26 objects, for example on magnetic strips or via. visual encoding techniques — or the 

27 appropriate information can be accessed from a secure remote database based on the user's 

28 physical location (using LEIA location technology); or bar-codes that provide a universal 

29 identifier for an object. 

30 As an extension to this model, we also allow users, vendors, and other third parties 

3 1 to associate "meta-information" with other users and vendors. This information might be a 
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1 user's opinion about his/her interaction with another user, an annotation that relates to a 

2 particular web page, or information about a physical object. The system of SDI enhances the 

3 value of this information by providing a secure environment where users can report meta- 

4 information (i.e. opinions) along with their profile information, to permit collaborative 

5 filtering techniques to generate appropriate meta-information about an object (user, physical 

6 object, vendor, web page, etc.) that will be useful to a particular user — given that user's own 

7 profile. We define 'Virtual tags" as any piece of information about an object (physical or 

8 virtual). The information may be authored by any party, but annotated accordingly. For 

9 example, the appropriate virtual tag provided by a user about his/her-self is the 

10 pseudonymous profile for that user, and with SDI only the user his/her-self can gain 

11 access to the profile (either directly through editing, or indirectly through continuing 

12 transactions). 

13 We might implement a Kasbah-style "reputation system" within such a virtual 

14 community. Initially users (under pseudonyms) have no reputation, and their opinion does 

1 5 not count for much, but after every positive interaction (as defined by other parties in an 

16 interaction), the "reputation" of a user can increase, (see the Kasbah system, MIT) [Kasbah 

17 98] This reputation system is appropriate to a pseudonymous environment. Notice that 

1 8 gaining negative reputations is not useful when users can simply change identities. In one 

1 9 variation we can "block" certain users from providing information, when those users have 

20 negative reputations. Clearly, collaborative filtering or other data mining techniques could 

2 1 usefully allow for reputations when weighting information about an object. 

22 13.2.2.5 Meeting Planning 

23 The general application framework of matching users based on mutually beneficial 

24 and acceptable terms and conditions within a physical context, e.g., be it e commerce (as 

25 above described) business or social objectives, it is also useful to develop a mode which 

26 considers the schedules of the calendaring agents of the individuals, as well a. s the timing 

27 and (if relevant) the meeting opportunities which would be based upon mutually compatible 

28 meeting venues which are mutually time compatible). A statistical model could be 

29 developed which would take the above data (where appropriate) the geographic proximities 
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1 (considering lengths of travel routes between users may travel to attend a meeting) in order 

2 to determine: 

3 a. How to minimize the travel distance for individuals whose automatically scheduled 

4 meetings are combined as part of an existing travel activity might be if the meeting was 

5 scheduled, in part because it was of relevant values to the party (and it was at least relatively 

6 close to the existing intended travel route). 

7 b. It was sufficiently important to mandate a separate trip. The idea is to determine the 

8 optimal physical locations of individuals to physically reside, (i.e. for residents or even hotel 

9 lodging as well as possible the physical locations of business such that when the above user 

10 agents - agent introduction meeting and scheduling functionalities for physical meetings is 

11 performed for the amount of physical distance which must be traversed is optimally 

12 optimized. The statistical method used in this calculation would be similar to a scheduling 

13 optimization method used to schedule flights and associated routes to minimize the total 

14 distance traveled by all flight routes collectively (see Princeton publication using genetic 

15 computing techniques). Of course, the above approach would be used for a more 

16 immediately practical application in which the objective would be to simply reduce the 

1 7 amount of distance a user traveled for his/her daily activities and to tie this framework into 

18 an existing scheduling system which is appropriate for the user. It is possible to also further 

19 define the ultimate objective to be reducing the actual travel time which a user spends in a 

20 vehicle (thus the optimization algorithm would further factor in the effect of time delays due 

21 to highway congestion which would affect the ultimate ideal locations of these physical 

22 sites. 

23 A couple of concrete examples includes for example a real estate developer who 

24 develops clusters of home building sites or town homes which reflects the profiles of users 

25 sharing common or compatible profiles (e.g., socially, professionally or commercially) or 

26 recommending hotel lodging sites for individuals who share similar commonalties and 

27 happen to be visiting the same location or city. It is also possible to physically locate 

28 vendors at locations in which their most preferred customer prospects are most commonly 

29 physically traveling past or are physically situated (using LEIA) even considering where 

30 these prospects are during periods in which their shopper interest (or mood) for those 

3 1 items/services are heightened (or more generally factoring in both location and temporal 
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1 (mood) factors into the user profile data as it is herein applied for this purpose. Whether 

2 users or vendors (subject to user consent) share customer data, there are other potentially 

3 interesting and relevant applications, for example, based upon LEI A a user's calendar 

4 schedule perhaps even verbal clues, it may be possible to coordinate meeting between users 

5 and professionals who can provide a useful service e.g., if a specialty physician happens to 

6 be situated in the user's present or anticipated locality and the physician would be available 

7 for an appointment at that time and location. It is possible that ad hoc use of shared clinic 

8 space may be available for such ad hoc appointments. 

9 1 3.2.2.6 Investment Matchmaking/Venture Capital 

10 This section describes a market place in which start-ups can propose a venture or 

1 1 other financing needs in order to elicit an auction between different Venture Capital funds. 

12 We can allow potential investors to leverage securely confidential information regarding the 

13 details of business present and anticipated strategic alliances and customers. Each company 

14 securely registers all of this confidential business information SDL It is even possible for 

1 5 confidential information about future R&D initiatives proprietary know how and intellectual 

16 property to be entrusted with SDI such that potential synergies may be determined well in 

1 7 advance of the market. 

18 A domain expert(s) within SDI then determines potential synergy's between the 

19 various commercial entities both for purposes of facilitating introductions on a 

20 customer/vendor level, strategic partner level, as well as what particular financing sources 

2 1 which are compatible for the level of the financing needs and other characteristics of the 

22 financing such as terms, involvement on a control and/or management level (as well as 

23 compatible commercial technology venue in which they participate). The key objective is to 

24 identify as first priority the commercial entities and customers which can provide the 

25 greatest degree of benefit to the vendor then to, secondly, determine which entities within 

26 the VC/financing community are able to bring the most valuable of these contacts to the 

27 table. 

28 An investor may fund a small start-up and also improve the value of the start-up by 

29 facilitating a larger vendor becoming a customer. An investor might negotiate a special deal 

30 for such a vendor. In one variation SDI could be used to mediate the introduction of 
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1 optimally strategic investment opportunities to an investor and to identify strategic synergies 

2 among and between different commercial entities who are already funded by an investor(s). 

3 In this way it may be possible to introduce these investors and create joint participation with 

4 the new prospective investor. 

5 By leveraging SDI, the investors may even identify certain intriguing facts such as 

6 that the synergies between one of their own customers and the other investor are also 

7 significant and/or even dis-synergies exist between their own investors and themselves 

8 which do not exist with other investors (e.g. and investment in a competitor or customer of a 

9 competitor) that it would be mutually advantageous for the investors to entrust SDI with the 

1 0 job of making appropriate introductions for re-distributing equity ownership of the investors 

1 1 into those other companies which are more appropriate from a strategic standpoint. 

12 The motivation behind agreeing to these introductions of course must be bona fide, serious 

13 and not initiated with the malicious intent to capture otherwise confidential information 

14 (particularly for those investors at higher level financing levels such as higher level VCs, 

1 5 investment bankers and more generally investors who tend to be more minimally involved 

16 in direct management or marketing initiatives for the company) it is certainly reasonable to 

17 imagine that some of these investors may wish to apply some of the above suggested 

1 8 techniques for finding (or even exchange equity opportunities within the commercial market 

1 9 partially for the purpose of diversifying their risk by thus owning smaller equity stakes in 

20 more companies and particularly those which are most commercially strategic and mutually 

2 1 synergistic in nature. 

22 The powerful data collections of SDI regarding comprehensive 

23 commercial/investment data which of relevance to predicting the nature of an market 

24 dynamics of the associated commercial entities, and to build insurance for an investor. The 

25 primary objective is to identify investments which are predicted (using a variety of methods 

26 both human intuition and empirically based) to behave very similarly (including e.g. 

27 possessing a very similar, upside and risk potential). A secondary objective (unlike the 

28 above variation) is to then evaluate and actually identify competing companies to the one(s) 

29 the investment in which is being insured where it is believed that the success of the 

30 competitor(s) would have a negative impact on that of the company. Thus risk is effectively 

3 1 diversified among the overall success of those companies. The insurance could include a 
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1 deductible and premiums which are determined in accordance with the amount of coverage 

2 which the investor wishes to acquire. These premiums could be either paid in the form of 

3 cash or equity. In one variation "success" for insurance purposed could be measured as the 

4 success relative to the average of all of the others within the same similar group. 

5 13.2.2.7 Ideas Market 

6 Individuals can submit an idea to SDI along with the application framework, target 

7 market, or a list of vendors, which may most benefit as a result. Interested parties can then 

8 bid within SDI for the ideas. The system of SDI can operate the auction privately^ and only 

9 provide bidders with a limited amount of information about the idea. The problem is to 

10 automatically appraise the value of an idea for a particular vendor without providing the 

1 1 vendor with the idea, so that when the auction is over only the agent that wins actually 

12 learns detailed information about the idea. The price paid may include an up-front price and 

1 3 promises of long-term royalties or equity, etc. 

14 The privacy-secured ideas market is useful, because traditional negotiation processes 

15 fail: there is always a powerful motivation on the part of a commercial entity to steal the 

1 6 idea and leaving the originator with no compensation, consideration or acknowledgement. 

17 With SDI an individual with an idea is able to place an idea in the marketplace, so that 

1 8 commercial vendors can provide SDI with information regarding their business, to allow 

19 SDI to value the idea autonomously on behalf of the vendor; i.e. on the basis of current and 

20 future commercial research initiatives. SDI computes the estimated requirements and 

2 1 economic value that any given submitted idea would have towards that vendor. 

22 Human experts may play a role in evaluating the value of an idea to a vendor. In a 

23 busy idea market there is the need for a scheme by which ideas are automatically routed to 

24 the most appropriate individual experts to evaluate that idea, which is either within SDI or 

25 within the organization which most likely needs the idea. This routing scheme could be 

26 based partially upon attributes associated with experts and features extracted from the 

27 description of an idea. 
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1 13.2.2.8 Negotiation Intermediation 

2 Given the above general application framework for SDI in which commercial 

3 entities can securely entrust to SDI with their commercial marketing and technology related 

4 focus and objectives, it is a reasonable extension to further extend this data Rich 

5 framework to enable the above described exchange of confidential information among and 

6 between commercial entities to occur not merely in a purely general and non-dynamic 

7 fashion but to additionally incorporate timely information which may represent time 

8 sensitive and critical decision processes which are in the process of occurring at that 

9 particular time and which may be relevant to the mutual strategic discussions and 

10 negotiations between the entities. A very important application/example of this approach is 

11 the fact that commercial entities may be in the process of negotiation with another 

12 commercial entity regarding particular commercial or technology related opportunity very 

1 3 often (the majority of instances) the nature of these discussions and particularly the identity 

14 (identities) is maintained as strictly confidential information throughout the discourse of 

15 these negotiations. Though the following solution may not be appropriate for all cases of 

16 confidential discussions and negotiations there may be particular instances in which a 

1 7 knowledgeable "expert" within SDI could be confidentially kept apprised of the details of 

1 8 the negotiations as well as progress and any obstacles as they occur. Of course, SDI and the 

19 expert have additional knowledge about the general and temporally specific needs and 

20 objectives of other third parties who may be potentially interested in the nature of the deal in 

21 progress. 

22 At a general level there may be the opportunity for the expert to provide high level 

23 probing questions to the appropriate parties or decision makers within the other commercial 

24 entities which certainly do not reveal any identifying information about the negotiating 

25 parties, do not reveal any proprietary technical details which would compromise the 

26 proprietary nature of this information and enable the identity (identities)of the parties to be 

27 indirectly inferred in this regard. However, business terms and perceived value exchange 

28 may be thus conveyed as limited facts in the abstract or possibly in more detail depending 

29 on the particular situation. There may also be particular critical junctures and impasses in 

30 the negotiation at which point the vendor may be compelled to concede. These points may 
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1 be critical points at which it may be opportune for SDI to initiate or go into a deeper level of 

2 investigation with the other parties regarding the critical issues at hand. The above 

3 precedent may even be replicated among all other third parties, which SDI deems to be 

4 potentially receptive and legible to the prospective deal. This process is analogous to time 

5 of purchase competition in that ifwhen another potential deal is identified from another 

6 entity which is more desirable the original negotiating party may be informed (as well as 

7 possibly the other relevant parties). This process may be reiterated as well. 

8 1 3.2.3. A Real Time Experts Market 

9 In this section we suggest a market for expert opinions, in which users with an 

10 information-need are matched in real-time with "experts", that are prepared to provide 

1 1 information and opinions in return for a payment. This economic approach can be used for 

12 example, if a vendor would like a user to provide feedback about its products and/or 

1 3 services: the user becomes an "expert" and can receive payment in terms of discounts in the 

14 future. The information in SDI can be used for the automated selection and user targeting 

1 5 of tasks, based upon profile information. 

16 We can allow users to be identified by another individual in which there are 

1 7 mutually beneficial opportunities for both parties to interact and terms/conditions for the 

18 disclosure to the other party is defined within the user's data disclosure policy. For 

19 example, issued patent 6,029,195 Herz, et al System for Customized Electronic 

20 Identification of Desirable Objects describes a "system methodology by which users are 

21 able to find a knowledge domain expert to answer a query, deliver personalized advice for a 

22 particular issue or problem to which they are extremely knowledgeable about, and provide 

23 references to other information sources. 

24 The parent issued patent U.S Patent # 5754939 describes techniques for 

25 identifying experts on a communication network based upon their profiles and the search 

26 profile of a requester. In the present methodology we further provide an economic 

27 mechanism to encourage well qualified experts to provide options, and find experts. 

28 There is currently within the prior art economic incentive schemes which can be adapted 

29 and effectively employed in this case in which it is possible to reward very well qualified 

30 (and truthful) opinions on a very broad range of issues and domains in which the 
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1 accuracy of the ultimate outcome of a particular opinion is directly rewarded. This 

2 mechanism effectively insures the quality of the referrals/recommendations and is 

3 considerably more efficient then immediate payments for referrals where quality and 

4 truthfulness of the expert advice cannot be audited prior to payment, thus creating a 

5 disincentive to provide truly "valuable" advice. 

6 The system of SDI can forward requests for information to experts within the same 
1 system as time-of-purchase competition, SDI acts as a CLEARING HOUSE for requests for 

8 information, and experts can bid to provide responses. The system of SDI can also help 

9 users to choose between experts, based on proficiency profiles that are derived from the 

10 value of information that experts have provided to other users. We can allow experts to bid 

1 1 for the right to annotate and provide ratings for particular pieces of information, products, 

12 services, etc. 

13 In a web annotation example, as users provide annotations and recommend links, 

14 and other users provide feedback about annotations, then the system of SDI can build a 

1 5 "proficiency profile" for a user, to indicate the ability of a user to add value to the browsing 

1 6 experiences of other users. The information retrieval and document clustering methods as 

17 taught within the parent case #57549398 provide a statistically sound methodology to 

18 develop a user profile that predicts the "proficiency" of a user to provide recommendations 

1 9 about objects that fall in particular clusters. 

20 In the economic variation we can allow agents to bid for the right to receive high 

21 quality recommendations, at least in the case where recommendations are new and cannot 

22 be duplication on the basis of current recommendations. We can create an "experts market", 

23 where experts are assessed on their ability to provide quality to previous agents, as noted by 

24 those agents. SDI could also suggest a list of content categories that an expert might 

25 consider operating within. We can also apply the rapid profiling techniques in the parent 

26 case to assess a user's expertise based on his/her response to a set of questions. A rapid 

27 profiling tree is essentially a decision tree, which can be used to present a sequence of items 

28 to which a user "responds". The rapid profiling method enables a user profile to be 

29 constructed in the shortest possible sequence of inquiries to the user. In this case, the 

30 response is the user's recommendation links (which are then judged). With the benefit of a 
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1 comprehensive proficiency profile of the user the system may identify and present items to 

2 the user automatically (if s/he is amenable) for which the user is predicted to be proficient. 

3 We may pay individuals based on bonus points or coupons with future redeemable value 

4 across a network of vendor sites (e.g., within that particular SDI recommended alliance of 

5 vendor sites which may be dynamically generated for each individual user). Experts that 

6 include links to pages could be paid for the value of increased value based on those links. Of 

7 course SDI can continue to ensure that users are not motivated to provide biased links 

8 because of this type of financial incentive, continuing to monitor how useful other users find 

9 the links and recommendations. 

10 The market for experts and real-time information filtering can be applied to new 

1 1 content. Via collaborative-filtering based systems we can push new articles to appropriate 

12 users, so long as experts provide initial scores for articles. In such an automated news 

1 3 system users that represent particular types of customers can be paid to provide ratings of 

14 incoming news articles. Based on profile information these ratings then imply ratings for the 

15 customers of a news cutting service, therefore allowing filtering of news to the most 

1 6 appropriate and narrowly defined subgroup of users of interests. 

17 It can further be combined within the framework of the present incentive scheme to 

1 8 encourage users who tend to individually find news, which is of particular relevance to their 

19 daily lives and unique interests to proactively collect information (including but not limited 

20 to digital recorded audio/video) and provide that information to news distribution systems. 

21 Collaborative filtering leveraging both the overall quality/interest of user content as well as 

22 of the nature of the new content which that particular user is providing can provide a nice 

23 method to filter and channel this information. 

24 The above bidding scheme enables a highly individualized incentive driven 

25 distributed news reporting system, which can significantly increase the availability of 

26 content for individuals. Users can continue to rate specific stories that they receive, so that 

27 the profiles of users are dynamically updated, and experts can be swapped in and out based 

28 on the assessed value and accuracy of new clipping. 

29 The present clustering techniques may also be used to identify users that are able to 

30 provide useful new content. This content may range from commentary, opinions, critiques 

3 1 and comparisons, but can typically be more robust than simple annotations, and may include 
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1 articles by knowledgeable experts providing pertinent or new information which is 

2 interesting or timely within the present context of the site. The predicted value to the 

3 particular cluster of users is a function of both the object profile of the original page as well 

4 as (perhaps even more relevantly) the object profile of the new content. 

5 We can also have experts bid for the right to participate in online discussions, where 

6 experts are selected that are polarized on a current controversial issue, in order to fiercely 

7 debate a particular issue. We could also select representative group members, from within 

8 different primary clusters, based on the domain knowledge, skills, life experiences, 

9 preferences social/political or religious view or opinions of users, to engage in an active 

10 dialogue, interview or debate with the high profile individuals. Other members of clusters 

11 can be invited to observe a discussion, and to provide feedback to maintain information 

1 2 about the quality of experts. 

13 13.2.3.1 Expert Sales Consultants 

14 A static application of the information in the system of SDI to on-line sales is to use 

15 the information to generate an ideal list of prospects for sales agents, and then help sales 

16 agents to target products and offers to individuals. We can provide information to 

17 salespeople about users, according to the profile of a salesperson (and reputation), and a 

1 8 user's personal terms for data-disclosure. Similarly, a system of SDI in conjunction with the 

1 9 methods taught in co-pending patent allow automatic detection of salespeople close to users 

20 (via an anonymous location market). The market allows matches to be made, but does not 

21 reveal anything about a user that the user does not authorize. User profiles can also be used 

22 to determine responses to offers and products (see the methods in patent application 

23 "System for the Automatic Determination of Customized Prices and Promotions" [U.S. 

24 Patent Application #10/262,123]. SDI allows profiles to be built from extended interactions 

25 across multiple vendors, so long as the user authorizes the same pseudonym for each 

26 vendor. 

27 As an example, consider an SDI-based system for contacting prospective customers 

28 and making personalized offers. The first step is to access the SDI pseudonymous user data 

29 base and identify the ideal audience of prospects for a product offering(s); then the second- 

30 step is to use the pseudonymous request protocols to request a sales call dialogue (or some 
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1 other form of dialogue) with the target prospect. The user's agent will then typically request 

2 further information (either before or after allowing access to even the pseudonymous profile 

3 database) such as what is the type of product (even possibly further associated details), and 

4 an associated approximate price range (which may be either negotiable or non-negotiable). 

5 The user may agree to allow a communication in the form of a targeted email that contains 

6 information content, or a link to a web page with additional information. This Web link 

7 and/or the e-mail can be personalized for the user, based on the user's profile. An email link 

8 on the web page could be used for direct Web-based IP telephony connection to the user 

9 (seller). Alternatively, a user's initial contact to web site could include encoded information 

10 about its profile, where the vendor can only receive the profile information if it can 

1 1 demonstrate certain profile properties. Large customers might then also receive specially 

12 hand-crafted information, if it is indicated that there is a large potential contract. The user is 

13 provided with a unique URL, or preferably the site is initialized so that when its pseudonym 

1 4 is provided the offer is customized appropriately. 

15 A more interesting example is a dynamic "experts market", which is an application 

1 6 of dynamic matchmaking between users in real-time based on their profile information is to 

1 7 a system for a virtual sales force. Users can also use seller profiles, to decide whether or not 

18 to interact with a seller. The profile of a sales-person may show correlations between 

19 product sell-rate and the type of product, type of user, that the sales-person interacts with. 

20 Initially seller profiles may not be very well related to sales-performance, but instead based 

21 on general SDI-style profiling, and wider (e.g. professional) credentials. Later, as a seller 

22 gains experience, profiling can be based on a sales-person's track record (and this will 

23 subsume other information). 

24 A consumer can indicate its preferences in its profile for "approach" by a sales 

25 agent, and then a match can be made with an appropriate sales agent dynamically in real- 

26 time via the above agent-agent matching and introduction process. The sales agent may be 

27 compensated when introductions lead to sales. The user could also receive preferential terms 

28 for a purchase, that can be realized online or in a physical store via coded print-coupons. 

29 This code with the time and identity of the (prospective) buyer as disclosed by the sales 

30 person. The buyer must authenticate that sales interaction (e.g., via disclosure of the code). 
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1 The criteria for matching the seller with a qualified prospective buyer is represented 

2 by client-provided meta-information associated with profile information, and can be used 

3 within a collaborative filtering system to determine the level of predicted interest which the 

4 buyer is likely to have for the items offered by a particular seller. The product space 

5 predicted to be useful to a buyer can be predicted based on his/her profile, browsing 

6 patterns, etc. Context can also play an important role, for example is the buyer currently 

7 engaged in browsing or buying related activities, can the buyer be persuaded to buy now 

8 with a good offer, or will the buyer never buy until he/she has performed more product 

9 search? 

10 Vendors may also compete for sellers based on the profile information and success- 

1 1 profiles of a seller, and a perceived match with the products offered by the vendor. A seller 

12 may also work for numerous categories of products/services and vendors. Finally, the sellers 

1 3 in the market could enter into a market-based system, so that buyers bid for the right to work 

1 4 with a particular sales person. 

1 5 Two other aspects of the present access to or by a prospective sales person include 

16 (1) In an off-line context, we can use a LEIA-based method to identify buyers and sellers 

17 with similar profiles in physical space, and dynamically reroute their paths to allow a 

1 8 meeting to discuss a possible trade. 

19 (2) Personal "Chemistry". In addition to a pure "product-space" set of profile features, the 

20 system might also consider wider compatibility between sellers and buyers in making 

2 1 meetings, for example choosing to introduce agents that share similar hobbies and spare- 

22 time activities. 

23 The problem might also be informational: e.g. find an expert on ancient American 

24 civilization for purposes of writing an article, or answering a specific question. Relevant 

25 information might include the expert's resume, and the expert's knowledge expertise profile 

26 developed from his/her activities in responding to previous queries. 

27 We might use a "fuzzy rule" to determine whether a user has a profile that is 

28 sufficiently close to an agent's goal profile to allow an agent-agent interaction and exchange 

29 of information. When the rule does not quite fire the system of SDI might also seek to 

30 clarify points of uncertainty, requesting further information until there is enough information 

3 1 to decide on the appropriateness of a contact. 
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1 It is clearly desirable to automate information exchange as much as possible, so that 

2 the first thing that a user knows about an agent-agent negotiation is after a deal has been 

3 struck that satisfies a user's preferences. 

4 One approach is to perform data analysis on a large data set of users who share very 

5 similar profiles and to perform very similar actions and behaviors in all aspects of their 

6 successful interactions with other agents. The data analysis might also be used to suggest to 

7 agents when there is a problem, and when it will be useful for a user to provide more 

8 accurate information about what types of actions it is looking to perform in certain 

9 scenarios, i.e. when the current level of confidence about the action desired by the user is 

1 0 low. In many cases the system might be able to identify with a high statistical confidence 

1 1 level an appropriate action based on the collective statistics of other similar users and 

1 2 conditions. 

13 When uncertainty exists the system can suggest the actions that see reasonable, and 

14 allow the user to choose one action. A natural language interface might communicate to the 

15 user the system's interpretation of the user's current behavior, the statistical correlations 

1 6 with recommended autonomous actions. 

1 7 For example, if a user is at home in his/her study between 9-1 1PM she/he is likely to 

1 8 be engaged in work relative to a research publication thus an invitation to other colleagues 

19 in this area will be extended and/or communication requests accepted to contact the user 

20 accordingly under these conditions. The system may detect other conditions in which the 

21 user may reject requests to correspond with these individuals, when the user's current 

22 context is inappropriate. Given this kind of feedback about actions, a user may now 

23 periodically review and adopt rules. For example, a user could also state that whenever a 

24 user has recently corresponded with a particular individual then the user is in a social mind- 

25 set, and would rather not engage in work-related activities. 

26 There are numerous potential inputs to the system which could be considered in 

27 predicting what agent-mediated actions are, in fact, appropriate. The techniques of the 

28 parent patent application [INSERT US PATENT NO] allow us to make strong inferences 

29 about the particular mind set of a user; i.e. reflecting present interests or preferences which 

30 the user is likely to be receptive to, presently based upon such clues as who the user is 

3 1 presently interacting with, the content profiles of the present real-time dialogues, e. g, typed 
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1 or spoken through a communications network, (or simply passively collected off-line), the 

2 object profile of documents or web pages being interacted with. Activities may further be 

3 inferred by such indicators such as the particular physical location of the user (i.e. within a 

4 LEIA-based system [INSERT US PATENT NO]): the particular sequence of movements, 

5 the particular types of devices with which the user is interacting, etc. 

6 More complex rules are certainly also conceivable, such, as the user will not interact 

7 with other users except under specific conditions, i.e. I will only talk to a user of type X 

8 (salesperson) if I have done nothing for the past 10 minutes and have nothing scheduled in 

9 my meeting planner. Such rules are initially hard-coded, but can then be learned and/or 

10 validated based on inductive learning: i.e. is it the case that I am more receptive to certain 

1 1 types of activities at particular times of the day, etc? 

12 In some cases SDI may identify individuals whose personal data revelation requires 

1 3 certain minimal requested personal data from the other party as a condition to an interaction, 

14 and SDI can again be used to automate this process is if it is well explained in a user's 

1 5 profile. When confidence is low the user may request additional information (anonymously, 

1 6 via SDI) and make a personal judgment in real time as to whether to continue an interaction 

1 7 and reveal more information, and whether or not an introduction is desirable. 

18 When information is valuable and sensitive, then it is important that initial 

19 information exchange is via SDI, without the end-users receiving explicit information. In 

20 overview, a user states the types of conditions that are necessary for a continued one-on-one 

21 interaction, and the system of SDI states whether or not conditions have been met without 

22 giving any specifics. Of course the identities of the agents are not released until 

23 introductions are made. SDI can track all activities, behaviors, and correspondences in 

24 which each user engages, and build a complete picture of the profile of a user with regards 

25 to interactions and the exchange of information. SDI may also use human experts to 

26 interpret requests and build rule-based systems to semi-automate decisions and automate 

27 easy cases. 

28 A simple but related application could be applied in the e-commerce domain to 

29 connect on-line customers with human sales representatives. A current version of this, 

30 offered by eStara, uses Internet-based telephony technology to enable browsers of on-line 

3 1 catalog pages to talk with knowledgeable sales representatives, simply by clicking on a 
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1 "talk" button displayed on the web page of interest. Integrated into this methodology is 

2 the ability of the sales representative to identify which particular web page the visitor is 

3 currently viewing, thus enabling him/her to more efficiently and intelligently assess the 

4 needs of the customer. 

5 This methodology could be further enhanced through the tracking of the user's 

6 pseudonym and associated profile data provided via SDI, which represents a far richer set 

7 of information than that used by eStara (which only makes use of the current web page 

8 identifier). A customer's full profile can include demographics, previous purchases, 

9 previous web-site visits, physical measurements (for clothing purchases), health history, 

10 and income. 

11 In a sense, individual sales representatives also have personal profiles; these can 

12 include experience, demographics, languages spoken, previous customer contacts, and 

13 product knowledge. An especially important factor is a representative's relative success 

14 in selling particular products - this history may reflect various personal strengths and 

1 5 weaknesses. 

16 When the customer clicks on an on-line catalog's "talk" button, his profile is 

17 immediately forwarded to the sales system, along with the particular product he is 

18 viewing. Matching algorithms (as described elsewhere in the patent) then choose the 

19 most relevant sales representative currently available for on-line communications. The 

20 representative is presented with a screen containing facts pertinent to the sale (including 

21 product details and extracts of the customer's profile), and he is then linked to the 

22 customer via Internet telephony. 
23 

24 Extensions to this technology include: 

25 

26 1 . Customized Price Discounts and Promotional Offer Recommendations 

27 By combining the predicted affinity of a user toward a particular item with price 

28 elasticity metrics (gleaned from his/her transaction logs and from real-time feedback 

29 provided to the system by the sales representative (e.g., the customer appears to be in a 

30 happy and spending mood)), it is possible to calculate a "personalized" price that 

3 1 optimizes the expected profit from a sale. This optimized sales price is communicated to 
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1 the sales representative, who is authorized to offer the discount to the customer. Digital 

2 coupon technology would be used to ensure that the offer is redeemed by that customer 

3 exclusively, and within a pre-set time limit. 
4 

5 2. Automated Sales Representative Support 

6 While interacting with the customer, the sales representative is presented with 

7 constantly-updated screen that provides information to support his sales effort. 
8 

9 a. Rapid Profiling 

10 If not much information is known about a customer, the representative 

11 would be guided through a script designed to elicit the maximum amount of 

12 information in the least number of questions. This information would be used to 

1 3 rapidly supplement the customer's profile. 
14 

15 b. Customized Sales Scripts 

16 The representative's scripted sales pitch would be dynamically adjusted to 

1 7 reflect the profile and current attitude of the customer. For example, if a customer 

18 is having trouble making a decision to buy (which could be detected by voice 

1 9 analysis tools and the total time elapsed in the transaction), especially persuasive 

20 text, geared to the customer's demographic, would be presented to the 

21 representative. 
22 

23 c. Additional Offers 

24 If separate, but related, products are predicted to be of special interest to 

25 the customer, the sales representative is alerted so that he can promote the 

26 additional offers while still in contact with the customer. 

27 3. Barter Exchange for Personal Favors 

28 In this variation, SDI can act as an exchange where professionals can exchange 

29 personal favors, for example one agent might indicate that it wishes to be introduced to 

30 another agent that can help with a particular professional problem. Both buyers and sellers 

31 can submit their respective needs, for example the need for a particular personal or 
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1 professional favor, and in the case of sellers the ability to provide favors. Other personal 

2 information, for example: professional, political contacts, organizational affiliations, areas of 

3 competence, professional responsibility and spheres of influence etc. can be confidentially 

4 entrusted with SDI and used to make particular types of matches. 

5 SDI can initially automatically define appropriate matches between the "buyers" and 

6 "sellers". Given sufficient data, humans may use content analysis techniques to define rules 

7 based upon certain key examples or commonly occurring request and matching solution 

8 pairs. The techniques of the parent patent application US Patent # 5754938 can also be used 

9 to refine rules, and create new rules, for the cases in which the level of confidence with 

1 0 introductions based with the current rule set is too low. The mechanism for establishing the 

1 1 value at which favors are sold might involve a bidding scheme, but in this case professionals 

12 may bid a "personal favor" in return for another favor. I.e. the transaction becomes 

13 disintermediated from the outside economy, with a pure non-money based method for 

14 exchanging favors. A user with the ability to provide a favor receives bids from other users 

1 5 in need of the favor she/he is able to provide. Of course, the value of a given favor is likely 

16 to be worth more to some individuals than others. 

17 The nature of the favor may range from very small and mundane, e.g., providing 

1 8 professional or personal advice or answers to a question, or it may be very significant and 

19 resource intensive from the standpoint of the provider, e.g., making a professional 

20 recommendation for a job position or political decision or providing 

21 recommendations/referrals for clients on a professional level or providing business 

22 recommendations to a partner or affiliate regarding a given individual or commercial entity 

23 with which to do business. 

24 The techniques of peer-ratings within a reputation system can be introduced, and 

25 extended to cover a wide array of qualitative descriptive attributes to reveal the 

26 qualifications and ability of an individual, to allow some individuals in the bidding process 

27 to be excluded in an initial "filtering process", because they fall below expected standards of 

28 service and performance. An auction based model may establish a price for a particular type 

29 of favor, and for a particular individual, and therefore users with high performance ratings 

30 could be expected to sell for higher prices than other agents. 
31 
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1 4. Barter Exchange For Favors 

2 The qualification threshold of the provider of the favor is more important in some 

3 applications than in others. For example, if the favor involves an interaction with a 

4 particular individual for organization (e.g. a political favor) in which the individual's 

5 anticipated ability and qualifications may be a significant factor in the likelihood to achieve 

6 the desired results. And thresholds may be set relating to quality or qualifications in their 

7 regard depending upon the relevance of this factor and/or the importance of the favor, which 

8 depends upon the type of favor requested. The bidding process is typically a reverse auction 

9 in which the request favor is passed around to multiple individuals (or entities) which SDI 

10 deems qualified for the particular favor requested (thus the lowest price which fits the 

1 1 qualifications criteria is typically the offer which is accepted. In other (perhaps many) of the 

1 2 barter exchange for favors, the qualifications of the requester may be an important factor for 

13 the requestee. These qualifications may include, not only professional abilities, 

14 proficiencies and credentials, but potentially much more subtle attributes relating to the 

15 individual's interpersonal, social or psychological profile and/or behavioral profile, for 

16 example, how SDI predicts that the individual receiving the favor will integrate and adapt 

17 within a particular professional context and/or interpersonal context which defines the 

1 8 situation and/or environment with which that individual must interact and/or perform, e.g., 

19 as in a business deal, a professional opportunity, an educational opportunity, social 

20 opportunity, etc. In order for this pre-qualification of the requester to unilaterally work so 

21 as to assure the requestee with the level of confidence and trust in the requester which 

22 approaches that of a traditional favor (in which the requester and requestee are typically 

23 extremely knowledgeable and trusting in one another) requires a very efficient and perhaps 

24 fairly comprehensive reputation system revealing much about the above described personal 

25 and professional aspects of one of both parties. As suggested, in such a system and just as 

26 important, is the trustworthiness of the individual providing his/her opinion about the 

27 credentialed profiles of the parties. Indeed a political or decision maker in a large 

28 organization would be extremely remiss (and perhaps even politically harmed) if an office 

29 or position were provided to another individual (requester) in which weaknesses in that 

30 individual's professional and/or personality later became evident to the detriment of the 

3 1 organization or political entity, as an example. 
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1 It is even possible, that in some cases, the need may exist for a pricing model in 

2 which the individual providing these credentials through the reputation systems is 

3 compensated for the task. If such an individual must be fairly knowledgeable and assure to 

4 properly judge professional or subtle personality components, this individual may develop 

5 an independent reputation for effectively and truthfully judging these certain characteristics 

6 on a category or domain basis. If the subset of "judges" is fully distributed and ubiquitous 

7 permeating a substantial portion of the population as a default, if the qualifications of the 

8 judge are unable to be validated in advance, it may be preferable to provide payment 

9 following a reasonable level competency and success on the part of the requestee and 

10 requester. Or, in another variation, users are required whenever appreciated (and in 

1 1 accordance with their own judging qualifications) to provide opinions on the individuals (or 

12 other entities), and they must comply up to a certain quota and with truthful and accurate 

13 opinions in order to maintain privileges to engage in the service (e.g., www.favor.com ) or 

14 other applications or services resulting from the reputation system. It is also, of course, 

1 5 critical to ensure the judge possesses with regards to the individual or entity with which 

16 he/she is affiliated. In this case, the assumption is that the barter exchange for favors could 

1 7 cover nearly any type of favor desired by a user and the reputation system is implemented 

1 8 for most individuals. Of course, the present system may also be extended within a business 

19 context in which the favors include those provided by in between businesses. In both the 

20 individual and business application context, it is necessary for the above described use of 

21 privacy policies to dictate what types of individuals or entities a user or organization to 

22 include, exclude or price discriminations against e.g. to various degrees). 

23 The following section "Agent-Mediated Value Exchange in the Supply Chain" talks 

24 about an economic system methodology in which a "value chain" is established for which 

25 entities may compensate other entities for present or often future value, which is likely to be 

26 sustained as a result of certain actions performed on behalf of that entity. This value often 

27 conveyed in the form of "barterables" could also be provided in the form of "favors" 

28 between the commercial entities (or potentially in a variation, individuals). 
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1 13.2.3.2 A Market for Referrals 

2 Situations frequently arise in a variety of contexts of human interaction (whether 

3 social or professional) in which a user may wish to refer the user they are in contact with 

4 to another individual. Often this occurs in a professional services context which a user 

5 has a particular need/or other characteristics which make him/her an appropriate match 

6 for the services provided by the other party. Or in a business context, often a user will 

7 forward a business contact or associate to another colleague who is deemed more 

8 appropriate for the particular context and/or scope of business. Likewise, in a personal or 

9 social context users may sometimes meet two or more individuals which they observe or 

10 perceive share common interests, goals or beliefs or perhaps possess complementary 

1 1 capabilities, knowledge, or characteristics. In each of the above scenarios, virtual tags 

12 may provide substantial benefits. For example, the referring user could forward the 

1 3 relevant portion of the profile and identified need of the user to the referring party whose 

14 user agent may determine the acceptability of the request and/or the priority with which a 

15 communication or meeting could be scheduled (e.g., as could be automatically arranged 

16 by/between the two party' calendaring agents). If the referring party's agent is unable to 

1 7 make a decision or priority assessment for scheduling purposes) on behalf of the user, the 

1 8 agent could instead try to contact the individual him/herself for assistance (and statistical 

1 9 feedback to the system's data model). In order for these types of referrals to be performed 

20 efficiently, the area of expertise required can be specified, and provisions can be made 

21 about the type of referrals that a professional will accept. 

22 We can establish a market model for referring individuals within SDI, to 

23 compensate users for successful referrals. With SDI we can create a market that allows a 

24 wide set of criteria, including social/interpersonal skills, moral/ethical values, etc. to be 

25 used to find good prospects for referral. SDI might also provide information to allow a 

26 system to predict the tasks or professional occupations most suited for a user. The system 

27 can look for correlations between profile data and professional successes, based on the 

28 historical track record of successes and failures. SDI might compute a list of human 

29 descriptive qualities that will secure payment for referrals, i.e. relevant attributes which 

30 SDI infers may be associated with useful types of social or professional individuals. 
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1 The scheme might also be used to identify individuals with raw human potential 

2 that can perform well with the right type of training. We might use such a scheme to 

3 discover hidden talent within under-developed or impoverished regions of a country or 

4 the world. 

5 1 3.2.3.3 Groupware: Collaborative Project Development 

6 The present system framework is designed to provide a secure infrastructure to 

7 allow users who are most appropriate to provide a particular type of contribution, based 

8 on a precise contribution needed for a project. The project may be a collective project, 

9 developed by users who have all been introduced to the project within the same context. 

10 There is also a role for an evaluative component to compute relevance feedback in 

1 1 qualifying and quantifying the nature and quality of the contribution. 

12 We can also use a query-based system to establish a user's relevance to a particular 

1 3 task, or another user, along the lines of the method in patent "System for Generation of 

14 User profiles for a System for Customized Electronic Identification of Desirable Objects" 

15 US Patent #5754939 

1 6 Consider the problem of generating a team of individuals to author a document. 

1 7 This scheme can be deployed across intranets, extranets or the Internet. The degree of 

18 confidentiality provided and produced may determine whether certain credentials are 

19 required, e.g., the individuals belong to certain organizations and are under strict terms of 

20 confidentiality, etc., or whether the document's level of confidentiality is low enough to 

21 enable free access by users on the Internet who may want to, and are able to, usefully 

22 contribute. The idea behind the present scheme is to leverage techniques of natural 

23 language processing and/or information retrieval in order to develop profiles of 

24 individuals based upon the value of their past contributions to documents; for example 

25 informational content, style of authoring, etc. In a Web-based application the content can 

26 be graphical designed content, perhaps even video segments. Within SDI it is possible, 

27 based on feedback from previous articles, to anticipate how valuable the information is 

28 likely to be for a particular readership. 

29 The problem might also be informational: e.g. find an expert on ancient American 

30 civilization for purposes of writing an article, or answering a specific question. Relevant 
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1 information might include the expert's resume, and the expert's knowledge expertise profile 

2 developed from his/her activities in responding to previous queries. Level of expertise 

3 might also include the size of projects performed within a particular specialized area, and 

4 relevant education qualifications. 

5 1 3.2.3.4 A System for Smart Consumer Research 

6 There is a significant need within the field of consumer product creation and 

7 development to be able to more instantaneously, and on a larger scale, collect direct 

8 feedback from a large number of consumers in lieu of much more sparsely populated focus 

9 groups and "product development" experts. There is also a need to better characterize 

10 consumers whose feedback is utilized in any kind of market test analysis. Collaborative 

1 1 filtering can provide a very efficient solution to both of these problems by enabling focus of 

12 feedback from selectively chosen consumers whose attitudes and opinions (and even 

13 marketing ideas) are the most meaningful and representative of the large segments, 

1 4 comprising the majority of the consumer population for that product. We can further extend 

1 5 this methodology to determine and measure the "value" of certain individuals in the process 

1 6 of product assessment, i.e. to identify the "experts" in a particular area. 

1 7 The information in SDI can be used to identify appropriate clusters of users for a 

18 particular product, so that a prototype or actual product can be presented to users, and 

19 feedback collected. The goal is to collect detailed feedback: across a variety of quality and 

20 consumer satisfaction criteria, more subjective opinions, areas of criticism and ideas for 

21 improvement or modification. The information in SDI on user profiles can be used to 

22 develop appropriate "test sets" of individuals for this purpose. In a cluster-based profile 

23 system, we might suggest that feedback is requested from agents that are close to the 

24 centroids of a cluster, i.e. the users that best represent a set of users. We can even selectively 

25 cluster users based on metrics to a new product. 

26 Analysis can be used to determine how to market and price products and services to 

27 different clusters of users. The resulting feedback can be useful in determining in what ways 

28 to better adapt the product or modify the concept to better match the preferences and needs 

29 of the nearby larger cluster (i.e., larger market population of consumers) or (in another 
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1 scenario) shift the acceptance rate from one consumer cluster with a high affinity to another 

2 with a lower affinity, however, with a larger market potential. 

3 We can use the information in the multi-user SDI database to predict the ability of a 

4 consumer to generate useful ideas for product concepts, improvements or extensions. These 

5 individuals typically (but not necessarily, exclusively) will belong to the cluster of the 

6 product. These users could also be used to create and assess various marketing/promotional 

7 ideas. 

8 Furthermore, we might build a system to provide incentives for individuals to 

9 identify commercial opportunities for a commercial entity. Again, profile information for 

1 0 the customer base of consumers might reveal possible synergies, and SDI might be able 

1 1 to derive descriptive summaries as to the type and nature of the opportunities which are 

12 best suited for a company, to allow easy future identification. 

13 13.2.4 Information Exchange via Traditional Print Media 

14 In this section we note that a personalized offer from a seller to a buyer can also 

15 be executed via traditional print media. We explain how a physical coupon can be 

16 printed, for example at the point-of-sale, or in some shop-based kiosk, or at an ATM. The 

17 coupon can digitally (e.g. Via a bar code) specify terms of a transaction, etc. The co- 

18 pending patent application entitled "System for the Automatic Determination of 

19 Customized Prices and Promotions" further describes the preferred implementation for 

20 implementing customized offers via coupons, through the use of cryptographically secure 

21 digital coupons which can authenticate the user and be delivered to the user in a non- 
22 transferable and non reusable manner. 

23 Within the system of SDI we can also use secure user features in the SDI database 

24 to offer special offers to an individual. Of course, the vendor that requests coupon 

25 generation based on information about profiles can execute the request without knowing 

26 the profile or identity of the user. 

27 As explained in the top-level SDI description, within the query-execution module 

28 rules can be specified that determine whether a profile is suitable for an offer, and the 

29 type of offer to make; and the system of SDI can automatically furnish the provider of the 

30 profile information with a coupon (as long as that is compatible with that user's requests). 
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1 The offer to a user can therefore be flexible, based upon a user's profile. The user may be 

2 given an encrypted code to present to a cashier when it makes a purchase, allowing offers 

3 to be made by vendors that are not on-line, to encourage a user to drive to their physical 

4 stores. More information can be available within SDI, for example to allow a user to 

5 receive information about quality ratings of vendors, for the products and services that 

6 have been provided to other users. In return, the vendor may request certain purchase 

7 pattern data from the user. In another variation, the user might enter certain identifying 

8 information such as the first several words of the offer, and then receive an email or fax 

9 of a barcode to take to the shop for scanning. 

10 This method of discounting can be extended to users with Personal Digital 

1 1 Assistants (e.g. palm computing devices) in a store, that can use a scanner to enter bar 

12 codes of products, and then enter into an agent-mediated negotiation to secure a good 

13 price for the product. The user may be requested to disclose certain personal data, if a 

14 "better" offer is to be provided. Again, even in the in-store application the vendor does 

15 not need to receive actual information associated with the user, the processing can be 

16 performed remotely either at the user's ISP proxy server or centrally on a SDI server, 

17 with the vendor providing methods to adjust prices and offers based on a user's profile, 

1 8 but without seeing the actual profile. 

1 9 SDI may also implement time-of-purchase competition, soliciting and revealing to 

20 the user competitive offers from vendors in stores in a close physical proximity to the 

21 user's location. The user could also be prevented with other types of useful information, 

22 including: information for competitive products including (nearby) locations, price and 

23 functionality for eliciting time of purchase competition as well as previous customer 

24 complaints about each vendor. 

25 Finally, an alternative to dynamic negotiation with the vendor is that price labels 

26 in stores could encode a strategy that the store proposes to use to price-negotiation with a 

27 user, and just download this strategy to SDI and combine it with the user's profile to 

28 compute the final offer. The user receives a validated offer from the vendor, that certifies 

29 that the user with his/her current pseudonym is able to receive the price or discount. The 

30 offer can be encrypted to prevent fraud by the user. 
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1 As suggested above, SDI acting on behalf of the user may selectively release only 

2 that information which based upon the presumed price/offer generation model used by 

3 the vendor will elicit a price decrease delivered to that user. Within this price negotiation 

4 process we can require that SDI releases just the right of information to optimize a user's 

5 price. This can be part of the understanding with a vendor (even though a vendor may 

6 commit to a strategy up-front). SDI cannot falsify information on behalf of a user, but is 

7 free to withhold information. 

8 It is certainly conceivable that such a technique could be deployed by vendors, to 

9 allow customized pricing for users as they shop in a store via information encoded with 

10 product identities, and negotiated via profiles stored in decentralized SDI nodes. A useful 

1 1 default to make the system work for non-SDI based users could be to allow a user to user 

12 averaged ratings on various criteria, annotations, etc. 

13 Extensions of the present scheme could include other delivery media such as 

14 radio, TV, billboard, etc. For example, a web site could be provided to access 

15 information about a digital and personalized coupon, or access could be provided via a 

16 web-enabled cell phone. Printed classifieds could allow a prospective buyer to access 

17 satisfaction ratings via a printed bar-code type device, to allow the user to enter the SDI 

18 database via its personal SDI device. A hybrid voice/telephone approach is also 

19 interesting, where a user automatically discloses certain types of information to the 

20 salesperson, but is left to explain other features that are not automatically provided. The 

21 LCD/browser of the phone might provide some of this information to the other user. This 

22 is an interesting hybrid; an SDI-enabled but human-mediated information exchange 

23 system. Profile information can be required of the party placing the call, so that a user can 

24 ascertain what kinds of information should be revealed to the caller, based on its profile. 

25 Social or business related communications via physical mail could apply these 

26 same techniques, with a typical encounter made off-line, but additional information 

27 provided by SDI in response (electronically), and supplemented as necessary with other 

28 information provided by the recipient of the mailing. As discussed elsewhere in the 

29 patent, the initial contact can be pseudonymous, with agents only identified with mutual 

30 consent. The mailings can be sent directly by SDI as the result of executing a query. 
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1 13.2.5 Resale Markets 

2 It could be useful to extend the system of time-of-purchase to allow individual 

3 sellers, in addition to vendors, to participate in requests for products from other agents in the 

4 system of SDL For example, we could allow an individual seller to inform the system of 

5 SDI that it has product X for sale, and that it is happy to submit ask price $Y whenever a 

6 customer within its local geographical area is about to make a purchase of a similar item. 

7 For a one-time price the system of SDI could quite simply incorporate these individual 

8 sellers on a continuous basis within a double market for goods and services. 

9 The system of SDI could even provide a more advanced service for sellers. An agent 

10 might inform the system that it has product X, and that it wishes to maximize its re-sale vs. 

1 1 usage value. The agent can indicate this time-cost tradeoff to the system of SDI, and then 

12 leave the system to track the average resale price, and determine an optimal time to provide 

1 3 the product for re-sale. 

14 A variation of the resale market could include a "barter exchange" in which the 

15 seller receives another item, typically of comparable value, from another seller. I.e. agent 1 

16 can state "I will barter A for B", agent 2 can state "I will barter B for C", etc. and the 

1 7 marketplace of SDI can look for a closed-loop of barters, such that every agent is happy. 

18 The user that buys a good can be an "advertiser" in a resale market, and other users 

19 subscribe and specify controls over the personal information made available to 

20 sellers/buyers. We can use a variation on credentials to act as "digital deeds", such that the 

21 system of SDI validates when a product was purchased by the user, and whether it was 

22 new/used. This takes much of the uncertainty out of re-sale markets because then buyers are 

23 sure that the user is not falsifying information and items in the market. Digital deeds can act 

24 as credentials for ownership of a given product. 

25 In fact, an agent can even be automatically asked by the system of SDI on 

26 completing any purchase whether it also wants to have the item listed and available for 

27 resale. If no, she/he is asked itfwhen at a later times she/he may change his/her mind. The 

28 system can request information from the agent about the price range that she/he would be 

29 interested to sell the item at, and the agent could be presented with a typical price range for 

30 the item. Of course, in a dynamic auction-based system this becomes the "reservation 

31 price", i.e. the lowest price that an agent is willing to accept. Just as buyers can have more 
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1 information about sellers, the sellers in the marketplace can have more information (even 

2 though it remains private) about buyers. For example, a seller can specify rules such as: 

3 "I will only enter into a contract to sell to an agent that has never defaulted on 

4 payment", etc. 

5 Similarly, users (e.g. owners of kittens) may have certain personal interests for items 

6 of personal or sentimental value, and wish that they are sold to certain types of users. An 

7 interesting extension allows vendors to impose restrictions on the re-sale of items. The 

8 system of SDI can implement these systems for a vendor, so that at least within SDI 

9 products of certain types (e.g. software products) are not sold second-hand to other users for 

10 cheap duplication. Information goods can also be protected at source — for example with 

1 1 digital watermarks to prevent unidentifiable duplication. 

1 2 Another variation is an auction scenario, where a seller brings an item to market, and 

1 3 buyers compete for the item. A professional, that makes a living by buying items and selling 

14 them on, might like an exclusive SDI-based auction site. Within the system of SDI we can 

1 5 support a network of auction-sites, that push agents around between sites and revenue-share, 

16 with source web-sites providing shares of revenue achieved in subsequent purchases on 

1 7 other sites. In another variation, the operator of a small site may be linked from a larger site, 

18 and provide a share of revenues to that larger site. SDI has a couple of important roles: 

19 monitoring purchases and ensuring that contracts are fulfilled, and also estimating the value 

20 of links if an up-front price is to be negotiated, based on information about the profiles of 

2 1 streams of customers to a particular site. 

22 The information in SDI can be used to allow disintermediation of consumer-to- 

23 consumer markets, with agents able to reach ideal prospects based on personal information 

24 stored in the shared database, make an anonymous offer, and the consummate the trade of 

25 possible. This is likely the way of the future in this consumer-to-consumer market. Just as a 

26 vendor in current e-commerce systems can store information about its own user-base, and 

27 use that information to send personalized offers to users, within the system of shard 

28 information in SDI all agents can share information with all other agents, and the playing 

29 field is leveled. Users can query the pseudonymous user profile data and (if permitted), gain 

30 direct personal access to appropriate target customer prospect. 
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1 Conversely, prospective customers who are interested in a particular item(s) may 

2 query and access the database of users who are knowledgeable regarding that particular 

3 product/service (which may possibly be under a pseudonym), and also accessible to the user, 

4 i.e. available at an appropriate time and in an appropriate location. 

5 13.2.6 Transportation Example 

6 Another application of user profile database, in the case that the database contains 

7 also real-time information, is in an application to transport scheduling problems. The traffic 

8 systems in many large systems are congested. Via the system of SDI it would be possible to 

9 allow agents to represent the wishes of their uses within an SDI-enabled mechanism that 

1 0 controlled access to certain lanes on highways. The goal might be to control the flow of 

1 1 traffic, such that users agree to pool vehicles based on similarities across planned trips, and 

12 users also payments in return for the right to travel at certain times and in certain lanes. The 

1 3 goal is a more efficient system that maximizes the sum value to all agents. 

14 his is an example of a more general application area, where agents can look for 

1 5 opportunities to change their actions and behavior in return for higher system efficiency, 

1 6 and/or payments. We describe below an application to user agents negotiating on behalf of 

17 individuals ('travelers") to optimize the efficiency of a shared transportation system. The 

1 8 fundamental idea is to embed multi-user awareness into the basic fabric of a transportation 

19 system, in which the system aims to maximize the efficiency of the system by allowing 

20 payments between individuals. 

21 To accomplish this task, the system must be able to anticipate both the immediate 

22 direct effects, a particular accommodative action will have on another agent(s) (mobile 

23 entities) as well as the indirect (cumulative) effect on the other agents and the associated 

24 time delay impact on each agent and on the entire transportation system as a whole. 

25 The techniques of the LEIA can be efficiently employed in tracking the present and 

26 anticipated activities, location and movement patterns of individuals who are in the process 

27 of traveling to a destination via simple or multiple modes of transportation, e.g., any 

28 combination of the following, i.e., pedestrians, automobiles, taxi, train, and bus (including 

29 public transportation). LEIA can be applied in both scheduling the flow of traffic for 

30 transportation media which are not subject to fixed time schedules (cabs, autos, pedestrians) 
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1 as well as to dynamically improve overall efficiency of the movement patterns in order to 

2 assure that users on a collective basis arrive at their desired destinations with minimal delay. 

3 The system of SDI can implement a real-time market where agents make and receive 

4 payments in return for changed actions from other users. For example, agents with high 

5 priority may be prepared to pay more than other agents for the right to enter a highway at 

6 any particular moment in time. 

7 The market pressures at any point in time are between the marginal cost to an 

8 individual for changing his/her plan, and the cumulative marginal benefit to other agents in 

9 the system. Within incentive-compatible mechanisms, such as the regular price-based 

10 market place (e.g. the Walrasian tatonnement model) with enough agents, then it is optimal 

11 for an agent to reveal its true value for different outcomes to the marketplace, and the system 

1 2 of SDI in "clearing" the market can maximize system efficiency. 

13 The central contribution of the SDI system is that it acts as a trusted auctioneer, 

14 receiving information from agents, and implementing an outcome based on that information 

15 without releasing that information to other agents. All that agents see are requested actions, 

1 6 and payments received or payments to make. 

17 We could model a core sample of individuals on an active basis to determine 

1 8 implied valuation functions for different types of users, based on feedback provided by those 

19 users about how happy they are at any point in time. Within an incentive compatible system 

20 it is optimal for an agent to have a truthful and complete representation of its valuation 

21 function. The problem is to discover that valuation function. Any help from SDI, via data 

22 mining techniques (i.e. similarities between my profile and the profile of another user that 

23 has rated his/her happiness) can be beneficial in reducing the costs of participation in such a 

24 scheme. Then, before taking a journey, the user could provide some explicit information 

25 regarding the nature of the travel, for example the level of importance in minimizing travel 

26 time, whether the trip is business or sightseeing, the time of day, etc. With this information 

27 the system can assess the user's valuation function, and then have the user participate within 

28 the market, and secure a plan based on actions from all agents. The system might compute 

29 price thresholds, that represent different things the user can achieve for different prices. The 

30 system based upon its determination of situations/context can presents the price and time 

3 1 and urgency inferred for a user prior; which can be reviewed and corrected by the user. 
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1 With this profile information, the auctioneer can now compute prices to clear the 

2 market and maximize economic efficiency in the system; taking the following approach: 

3 1 . Armed with statistical information about the users of the transportation system e.g. 

4 the highway for the next hour, compute static (fixed) prices for different options. 

5 The goal is to maximize the efficiency in the market through a simple pricing 

6 mechanism, that is computed based on information provided to the auctioneer in 

7 advance. 

8 2. Announce the prices, and allow users to use the system as they desire, with 

9 appropriate payments made. Payments can be readily extracted from agents via 

1 0 automatic toll systems. 

1 1 Of course over time the performance of such a system could be optimized, as the 

1 2 auctioneer (the SDI marketplace) learns about consumption patterns of the agents over time. 

13 One way to provide incentives for agents to provide information to the system to allow up- 



14 front computation of useful prices to support an efficient outcome would be provide 

15 discounts to the agents that provide information. When such a system works well then 

16 roadway traffic can be controlled, congestion controlled, via an automated price-based 

17 system. Additional information made available from within SDI might allow adjustments to 

1 8 prices, based on unexpected flows of traffic. Just because the prices are optimized once up- 

1 9 front, based on projections, and announced to agents using the system; it can still be possible 

20 to adjust prices during any particular period — so long as there is a method for that feedback 



21 to alter the actions of agents, i.e. there must always be an alternative to make such a system 

22 work. Consider a two-road system, one shorter and one longer, then the longer route would 

23 be priced less than the shorter route, and the price differential adjusted based on current 

24 levels of congestion. 

25 Again, to clarify, SDI has a key role in this system. Agents are only prepared to 

26 reveal valuation and trip data within a system that carefully protects their privacy, and 

27 controls that information. 

28 Via the optimization performed to compute appropriate prices the model takes into 

29 account the effect on congestion that consumption decisions have, i.e. there is a relationship 
3,0 between the value to an agent that selects option A and the number of agents that select 
3 1 option A. The auctioneer needs a model of the transportation system itself to perform this 
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1 type of computation. Essentially, we compute the fixed point of a non-linear system of 

2 equations. Stochastic optimization techniques are suitable for such a calculation. 

3 As the system collects more data it can be less important to require explicit 

4 information from agents within the system, unless it is believed that there is a special 

5 situation about to occur (for example a Flyers game), which will have unusual effects on 

6 traffic patterns. 

7 Co-pending patent application entitled LEIA-TR describes flexible tools and 

8 automatic traffic ticketing which could be integrated in to the same transaction 

9 infrastructure. SDI would provide a methodology to elicit information from agents, and 

10 dynamically set the price of traffic tickets and parking meter tolls to maximize the 

1 1 performance of a City's parking resources. For example, one goal of the transportation 

1 2 authority can be to optimize the amount of revenues it can achieve, by charging more during 

1 3 "high-demand" travel periods. 

14 Extensions of the present scheme could include coalition discounts, where coalitions 

1 5 of users can negotiate discounts based on group purchasing power (let us all use your system 

16 for a 1 0% discount or we will use an alternative road system). The coalitions may be formed 

17 automatically within SDI using shared profile information, allowing agents with non- 
18 competing interests to "pool their buying power". Finally, it is interesting to note that users 

19 who share identical travel objectives can be encouraged to share vehicles, and/or public 

20 transport can be provided as alternative means of transportation on a dynamic basis. 

21 13.3. Client-Side Data Mining Applications 

22 Finally, we describe applications of client-side data mining, where the private 

23 information about an agent remains on its local client machine. Personalized of services and 

24 products is performed by pushing methods to the client machine, where the methods 

25 compute based on local profile information the most appropriate information for the user. 

26 This is very useful because the user retains absolute control over his/her profile information, 

27 but can still receive the benefits of personalization. This is an extension of iamworthit: users 

28 still provide profile information to the central database, to allow central queries and to 

29 receive value for that information, but never release identified information to a vendor. 
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1 13.3.1 Client-Side Personalization 

2 Client-side personalization allows vendors to push personalization rules to client 

3 machines that are special queries which use information store in a local client profile 

4 database to provide a user with personalized information and/or services. The 

5 personalization rules use locally stored profile information to compute an optimal product or 

6 service, or to allow a vendor to configure a virtual shop front. The answer to a query is 

7 returned to a vendor, to allow the vendor to push suitable commands to the client to enable 

8 appropriate displays to be created on the device with which the user interacts with the client. 

9 Client side personalization is useful within the system of SDI because: 

10 a) It allows vendors to leverage profile information across multiple profiles that 

1 1 belong to an individual without explicitly receiving the information. 

12 b) It provides users with the convenience of using their true identities in payment 

13 and specification of delivery addresses for goods, because vendors do not need to 

14 receive any information about the profile of an individual except that which was 

1 5 learned during the transaction. 
16 

1 7 Although we describe methods to allow anonymous and/or pseudonymous payments 

18 and physical mailing of packages, through the use of third-parties or cryptographic 

1 9 techniques, it is useful to allow users to provide true identities because this fits better with 

20 traditional methods of business. 

2 1 Personalization rules select the most appropriate information and products to display 

22 to a user. The client machine computes a selection, and responds to a vendor with its 

23 selection, so that the vendor can push personalized content and make customized offers. The 

24 traditional method of personalization has a user push profile information to a vendor, and 

25 the vendor push personalized information to the user. The provider can learn information 

26 about the preferences of the consumer, perhaps more information than the consumer trusts 

27 the provider with, given that the provider might sell the information to other agents, or use 

28 the information to determine the identity of the consumer. 

29 In safe personalization the vendor pushes its evaluation method and generic 

30 information to the consumer. This provides the same level of personalization, but without 

31 releasing any explicit preference information about the user's preferences to the provider. 
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1 The vendor can still perform collaborative filtering in the central SDI data warehouse, for 

2 example with anonymous profile information or without receiving the information explicitly 

3 but submitting a collaborative filtering program directly as a query. With safe 

4 personalization the user can (a) receive financial rewards for his/her profile information; (b) 

5 provide profile information to allow collaborative filtering but retain ownership over that 

6 information; (c) receive targeted products and services. The method is illustrated in Figure 

7 19. 

8 The only information received by the vendor is implicit preference information 

9 based on the products or services that the consumer chooses as most appropriate. As a 

10 default, a user might choose to use a persistent pseudonym with a vendor, so that the vendor 

1 1 can collect profile information about the user across a number of interactions. The system of 

12 SDI also allows vendors to customize information on the basis of a user's transactions with 

13 other vendors, browsing activities across the web and also wider demographic information, 

14 all of which can be maintained at a user's client for safe personalization, and pushed to the 

1 5 central SDI data warehouse for collaborative filtering type applications. 

1 6 One draw back for a vendor is that it provides the agent computer with methods to 

17 target information. For example, Amazon.com must provide the agent computer of a 

18 consumer with a method to allow it to personalize future transactions. The method is 

19 valuable information to Amazon. However, to protect this method from other possibly 

20 competing vendors, e.g. www.barnesnoble.com, Amazon can use encrypted codes to 

21 represent the results of a profiling query on the user-side profile database, and these 

22 encrypted codes can even change periodically, or from user to user. 

23 In the basic variation of personalization the vendor pushes a personalization query to 

24 the agent, to be executed on the agent's computer in the client-side query-execution module 

25 on the basis of the agent's single-user data warehouse. This is illustrated in Figure 1 8 above. 

26 The vendor then receives the result of the personalization query, which might state for 

27 example: "this user is very like my concept of what user Mary might want to buy", where 

28 Mary is an aggregate user that represents data mining the vendor has performed. 

29 Alternatively, the result might say something more direct like "show the user product X", or 

30 make special offer "Y'\ Whatever information the vendor receives in response to the query, 

31 it can then push commands to a user's agent computer to display graphics that represent its 
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1 desired response. For example, in an Internet application with a user that is browsing via an 

2 Internet browser, the vendor could push HTTP commands for a personalized shop front for 

3 that user based on the results of its personalization query. 

4 We can describe a number of variations on this basic concept, that describe different 

5 method to compute the best action to take given a vendor's personalization method and a 

6 user's profile information: 
7 

8 1 . Every-time Client-side personalization. In this variation the vendor pushes a 

9 personalization method every time the user initiates a new interaction. The agent 

1 0 computer executes the personalization query, and makes its response to the 

1 1 vendor. The vendor can then send personalized content to the user. 

12 2. Periodic Client-side personalization. In this variation the vendor maintains (either 

13 vendor-side or user-side) the data that the last personalization was performed and 

1 4 pushes a new personalization query to the user periodically, when it thinks that 

1 5 the user's profile will have changed or its personalization methods have changed. 

1 6 At all other times, the user maintains as part of its profile the result of the most 

1 7 recent personalization, to send to the vendor and allow the vendor to provide 

1 8 personalized service. Alternatively, if the user interacts with the vendor under the 

1 9 same pseudonym in the future then the vendor can store the result of 

20 personalization for that vendor. 

21 3. One-time Client-side personalization. In this variation the agent computer only 

22 ever performs a single user-personalization step, which is then maintained by the 

23 user as part of its profile for that vendor and passed in the future to allow 

24 personalization. 

25 It can be computationally complex to compute the result of a personalization query. 

26 For example, profiles about the objects on a vendors web page or in its inventory can be as 

27 large as the object descriptions themselves, and full decision tree representations to decide 

28 how to target a consumer can be very large and complex. In cases where this is a problem, 

29 and it is inefficient or infeasible for the vendor to push a complete decision tree to the user 

30 we propose the following solution: 
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1 4. Iterative Client-side Personalization. In iterative client-side personalization the 

2 vendor and the user, via their agent computers, participate in an iterative 

3 distributed protocol to compute a personalization result for a user based on its 

4 profile. A similar method was earlier disclosed in US Patent #5753938 The idea 

5 is to structure the decision tree, and for example pass initially the first few levels 

6 to the agent. The agent computes the result of those levels, responds to the 

7 vendor, and the vendor passes the next few levels that are relevant to the result 

8 received. In this way, only the parts of the decision tree which are actually used to 

9 compute the result of a personalization query are exchanged with the consumer's 
10 agent computer. 

11 

12 Specifically for collaborative filtering applications, a simple technique can be used 

1 3 to reduce the amount of information that must be provided to a user to compute the result of 

14 a personalization query. In collaborative filtering a vendor's decision about which objects 

1 5 are appropriate to show to a user is made on the basis of a similarity comparison between 

16 the user's profile and the profile of other users for which the vendor has information about 

1 7 how they responded to certain objects. The straightforward method is to pass the profiles for 

18 an entire user population to the agent's client machine. This is undesirable when there are 

1 9 many users in the population. 

20 A better solution is to pass only cluster centers to the client, where the cluster centers 

21 are chosen to allow collaborative filtering to a sufficient degree of accuracy. For example, 

22 we illustrate below an efficient method for collaborative filtering at the client, and our 

23 proposed efficient implementation. This is illustrated in Figure 20. 

24 The idea is to select a subset of users that represent the entire space quite well, which 

25 the vendor has collected by performing data mining queries on the central SDI data 

26 warehouse, and then submit just those data points to the client. In the Figure we plot the 

27 complete set of user profiles in profile space, with each user profile associated with 

28 information about how to target a product or service to a user with that profile. A typical 

29 method to perform collaborative filtering with data of this kind is to find profiles in the 

30 neighborhood of a new data point, illustrated with * in the above diagram, and then base the 

3 1 decision on what was successful or unsuccessful for those users. Clustering replaces a set of 
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1 similar users with one "cluster point" that represents what information a typical user in that 

2 region should be shown. This can be computed using standard cluster analysis techniques. 

3 The reduced space of data points, shown on the right, is sent to the client, and the client 

4 performs collaborative filtering over the cluster points to computer an approximate solution 

5 to the full personalization query. Completing the example, the agent computer can 

6 determine which "aggregate user" its profile is closest to, for example Mary or Yu-wen, and 

7 send this to the vendor. 



8 Notice that the vendor does not even have to provide the "y-values" or the 

9 personalization methods which are associated with each user profile, it is sufficient for the 

10 agent to compute the closest cluster center and pass that information back to the vendor. 

1 1 This is useful to vendors because the information is not useful to anyone but themselves, 

1 2 because no-one else can interpret what it means to be like aggregate "Mary". 

1 3 An alternative is to provide generic information to the client each time, and allow the 

14 client to display appropriate information automatically, without requesting information from 

1 5 the vendor. This removes the final handshake between client and vendor, and might be 

1 6 important if the user does not wish the vendor to receive any information about the result of 

1 7 personalization, not even how it decides to use the data provided by the vendor. The vendor 

1 8 only gets information in the case of a purchase. 

19 A simple extension, permitted with the architecture as described, because the agent 



20 computer implements the central SDI data warehouse query-execution and pricing modules 

21 but with only profile information about the single user, is to allow the user to demand 

22 payment in return for allowing the vendor to make a personalized sales pitch. 



23 13.3.1.1 Implementation Techniques 

24 The implementation of our system for privacy-protected personalization requires the 
2 5 following functionality: 

26 (1) A vendor must be able to push generic information to a user's client machine 

27 (2) A vendor must be able to push methods to a user's client machine that allow that 

28 client to process the generic information locally, using a user's profile information 

29 as an input into a decision theoretic rule (that can be rule-based, collaborative 

30 filtering based, etc.) 
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1 (3) A client machine must be able to process the generic information locally, together 

2 with a user's profile, and decide what information to display to the user. 
3 

4 A reasonable implementation of safe personalization within the system of Secure 

5 Data Interchange can leverage current World Wide Web standardization efforts that are 

6 based around the Extensible Meta Language (XML) [B97; CKR 97; KR97a; KR97b], and 

7 promoted by the World Wide Web consortium [W3C-XMLabc]. XML allows data to be 

8 structured and accompanied with a Data Description File that defines the semantics of the 

9 data, making it suitable for client-side processing by the receiving agent computer. The 

10 method to process the information can be for example implemented with the Java 

1 1 programming language, which will execute on a variety of different platforms via a Java 

12 Virtual machine. A vendor can send formatted data via XML and a Java program to target 

1 3 data to the user based on access to the user's profile information. 

14 It is important that the SDI client-side proxy agent carefully controls the access that 

15 the Java method gets to an agent's profile information, but this can be structured in just the 

16 same way as with query-execution in the central SDI data warehouse. The method must 

1 7 maintain a separation between profile information and the execution state of the program, 

18 and check that all data access commands are consistent with a user's price rules (and data 

19 access rules). 

20 Furthermore, it is important that the SDI client-side proxy agent executes the 

21 vendor's personalization query does not communicate profile information to other agents, 

22 and methods can be used to control the access of the query to communication ports, memory 

23 and disk space. 

24 Clearly, standardization of user's profile information is necessary, so that all vendors 



25 can write methods that can access a user's profile. One solution that would support 

26 implementations from multiple providers of profiling functionality (i.e. would not require 

27 clients running standard SDI-profiling software) would be to provider "XML-data 

28 wrappers", that convert non-standard formats into a standard XML data format, that is used 

29 by all personalization methods. One language for representing user profiles has been 

30 proposed by the W3C consortium, known as PIDL (personalization definition language) 

31 [W3C-PIDL]. 



Page 339 of 370 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 1 3.3.2 Real-Time Targeted Advertising 

2 A useful application of SDI allows vendors to provide targeted advertisements to 

3 users based on their browsing behavior and other profile information, but without actually 

4 receiving explicit information about a user's profile. We propose an advertising-server, 

5 which controls the network of adverts on behalf of vendors and users. A user's client 

6 machine can receive a choice of adverts whenever it hits a web site in the iamworthit 

7 advertising network, with local evaluation based on the local profile information at a client 

8 to decide on an appropriate advert to display. 

9 The system is designed to support dynamic personalized advertising, with vendor- 

10 side competition for the right to present an advert to a buyer, and buyer side criteria for 

1 1 accepting or refusing an advert. A technical solution is presented to allow vendors to select 

12 an appropriate bidding-policy, to maximize revenue from advertising. We use client-side 

1 3 profiling and advertising selection to support targeted advertising with absolute guarantees 

1 4 on the privacy of a buyer. The ad-server sends a selection of adverts to a user, and the user's 

1 5 client chooses to display the most appropriate advert. 

1 6 With a limited capacity to display an advert to a user, it is necessary to control the 

17 adverts that vendors display. We propose an auction-based system to compute prices 

1 8 dynamically though competition between vendors. The auction is a second-price sealed-bid 

1 9 auction that encourages advertisers to bid truthfully without "gaming" the outcome of the 

20 auction. The advertising auction is implemented on a user's client machine. 

21 In overview, the following steps are performed to support targeted advertising to 

22 users: 

23 1 . Buyer hits the web page of a vendor with an iamworthit advertising link. 

24 2. Vendor pushes content to the buyer (possibly personalized). 

25 3. Vendor contacts iamworthit ad server, requests an advert and pushes any profile 

26 information about the user. 

27 4. The advertising server represents a number of companies that wish to advertise 

28 with appropriate users, and determines a short list of adverts to offer to the user's 

29 client based on profile information. 

30 5. The user's client receives the choice of adverts and accepts an advert based on the 

3 1 values of bids and the likely suitability of an advert to a user. 
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1 6. The client requests the advert from the ad server, and then displays the ad. 

2 Payment is collected for viewing the advert, and finally the user's response to the 

3 advert can be stored in the central SDI database for future tuning of advertising policies. 

4 Figure 21 illustrates the system. The novelty provided by the system of SDI is that: 

5 (a) advertisers can decide when to place adverts and how much to bid based on historical 

6 information stored in the central SDI data warehouse; (b) the final advert is selected 

7 dynamically at the client with local profile information stored about a user. The stored 

8 profile information about users stored within Secure Data Interchange allows vendors and 

9 client-side SDI proxies to estimate the probability that a user will respond to an 

1 0 advertisement, given its profile and records about how other users with similar profiles have 

1 1 previously responded. 

12 Vendors, represented within the advertising-network server have access to some 

13 information about a user, and can compute an expected value of placing an advert, based on 

14 the final purchases made by other users that were shown adverts. Meanwhile, client-side 

15 machines have a more complete picture of a user's profile, and can compute the expected 

16 value of viewing an advert based on the advert and the price offered by vendors. In general, 

17 as the acceptability of an advert increases vendors will wish to bid more, and users will 

18 accept the advert for less money. Competition between vendors drives the bid price up, and 

1 9 allows users to receive the value of information about their profile to advertisers. 

20 13.3.2.1 Example: Technique to compute the expected value of an advert. 

21 A collaborative filtering system can be used to predict the probability that a user 

22 with profile x will respond to an advert. The computation is based on historical information 

23 in the central SDI data warehouse for similar users to the advert, and also on the basis of 

24 historical information for similar users to similar advertisements if there is little information 

25 available about the actual advertisement. Information can be provided to advertisers 

26 anonymously in performing queries, and also randomized if necessary. 

27 The basic technique is to select a set of users that have been shown the advert, or a 

28 similar advert, and are close in profile attributes to the current buyer. Call this the "relevant 

29 set". Given this, the probability Hit(x, Ad) that a user with profile x will hit an advert with 

30 profile Ad can be computed as the ratio Hit(x) = NumJHit / Num, where NumHit is the 
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1 number of users in the "relevant set" that responded to the advert, and Num is the total 

2 number of users in the relevant set. 

3 Although subject to a certain level of necessary uncertainty, because buyer behavior 

4 cannot be predicted perfectly, ultimately it is useful to predict with as high an accuracy as 

5 possible whether a buyer will respond. Given a probability Hit(x, Ad) that an individual 

6 buyer will respond to an advert, a vendor can define a bidding policy. The policy maps 

7 Hit(x, Ad) to a value to bid for the right to target a buyer. 

8 Assume in this section that the profiling system places buyers into one of several 

9 "classes" of buyers, CI to Cn, given an advert with profile Ad, where each class has an 

10 associated hit rate, i.e. x e CI =^> Hit(x, Ad) = Hit(Cl, Ad). Furthermore, assume that the 

1 1 system also predicts the average amount spent by a user that hits on the advert, or the 

12 average value to the vendor from a hit, this can again depend on the class of buyer, and can 

1 3 be determined within a profiling system based on historical information about this advert, or 

14 about similar adverts. Let Rev(Cl, Ad) denote the value to a vendor with advert Ad of 

15 getting a hit with a buyer in class CI. Now, the vendor can determine its expected value for 

16 placing an advert to a user in each class, for example using the computation Val(Cl, Ad) = 

17 Hit(Cl,Ad) * Rev(Cl,Ad). 

1 8 The system of vendor-side advert competition works as follows. Once a user hits a 

19 web page with an iamworthit-targeted advert, a choice of adverts are pushed to the user's 

20 client from the ad server. Each advert is represented as a three-tuple: 

21 ( link to location of advert, bidding function Bid(hit, rev) , profiling function 

22 Profile(x, Ad), value function Rev(x, Ad) ) 
23 

24 The profiling function Hit(x) = Profile(x, Ad) is used to place a buyer into the 

25 appropriate class of buyers, and compute the hit probability for a buyer, Hit(x) = Hit(x, Ad) 

26 = Hit(C, Ad), where x is the profile for the buyer, C is the class that the buyer's profile 

27 places it within, and Hit(C 9 Ad) is the probability that a buyer in the class will respond to 

28 the advert. A possible implementation of the profiling function is to encode it using 

29 prototypical cluster centers for a buyer in each of the set of buyer class types, and then 

30 assign the buyer with profile x to the cluster that is closest (in some well-defined metric) to 

3 1 the cluster type. The metric does not need to be linear in each of the dimensions of a user 
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1 profile, and in particular would be expected to be place more weight on terms that are 

2 important to the hit rate in a particular class of buyers. 

3 As described, this method has the following useful characteristics: 

4 a) the hit probability for a buyer is computed at the client machine, using the profile 

5 that is stored at the client for the buyer. The client machine does not need to 

6 release the user's profile, and the user's privacy is assured. 

7 b) The information necessary to compute the hit probability can be encoded in space 

8 linear in the number of clusters, which is much more efficient that passing 

9 information about every relevant user profile to the client machine. 
10 

1 1 Without computational/communication restrictions one might pass historical 

12 information in the form of (Profile, Hit/Miss) pairs for users that have been shown the 

1 3 advert, or shown a similar advert. A nearest-neighbor algorithm could then be used to find 

14 the relevant set of profiles for a new profile x and compute the expected hit-rate from the 

1 5 ratio of users with similar profiles that historically hit/missed the advert. 

16 The cluster centers approximate this solution, representing the average hit-rate of 

17 "close" profiles so that the client machine can simply locate the closest cluster center and 

1 8 use that as a proxy for the probability that its user will hit the advert. 

1 9 A similar technique can be used to compute the expected revenue from a buyer with 

20 profile x that hits an advert. In fact, this information can be computed using the same 

2 1 method, simply by associating an expected revenue with each cluster center. 

22 1 3.3.2.2 Client-side Advertising Reverse Auction 

23 The client implements an auction for the right to show an advert to a user. The 

24 auction is a Vickrey auction between all the adverts that are passed by the ad server to the 

25 client. The Vickrey auction (Vickrey, 1961) is a second-price sealed-bid auction. Given 

26 bids bl, b2, . , bn the auction sells the right to show an advert to a buyer to the vendor that 

27 bids the highest value for the value of the second-highest bid. The Vickrey auction is useful 

28 because it is truth-revealing. The optimal strategy of each vendor is to bid its true value for 

29 showing an advert to a user. For example, with a profit-margin of 20%, it is optimal for a 

30 vendor to submit bid = 0.2 * hit * rev. If accepted, it will pay at most bid, and it will pay 
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1 only enough to shave the bid of its nearest competitor. The auction is sealed, so competitors 

2 do not ever see the information in failed bids. Cryptographic techniques can also be applied 

3 to ensure that the bids are not inflated by the auctioneer. 

4 A user can also define an acceptance function in the auction, which represents her 

5 reservation price to view a particular advert. Assume that the reservation price depends on 

6 the hit rate hit of the advert, and represent the price as Accept(hit). 

7 The client-side advert auction runs as follows: 

8 1. For every advert Ad, Compute the hit rate of advert Ad, given the profiling 

9 function Profile(x, Ad) and the buyer's profile x, and the expected revenue from 

1 0 showing the advert to the user, rev. 

11 2. For every advert Ad, with hit rate hit = Hit(x, Ad), compute the acceptance level 

12 accept = Accept(Hit) for the buyer and the bid for the vendor with the advert, bid 

13 = Bid(hit, rev). If accept > bid then reject the bid for advert. If no adverts 

14 remain, jump to step 7 (in this case no advert is shown to the user). 

15 3. Given the bids that are not rejected by the client, sort them in order of increasing 

1 6 value, and insert a "buy" bid from the buyer equal to the value of its acceptance 

1 7 level for the advert with the bid of highest value. 

18 4. Accept the advert with the greatest bid, and charge the vendor the price of the 

1 9 second-highest bid. 

20 5. Fetch the graphics and URL for the advert, and display the advert to the user. 

21 6. The client monitors the actions of the buyer, and records (to be later transmitted to 

22 the iamworthit server) whether the buyer responds to the advert. 
23 

24 The buy bid in Step 3 will be less than the bid for that advert, because the advert 

25 was not rejected in step 2. However, this buy bid might be greater than the value of the 

26 second-highest bid for an advert, and is required to make sure that the price paid by the 

27 vendor that wins the auction is greater than the buyer's acceptance level. We make sure that 

28 the user cannot cheat by bidding just below the highest bid received by requiring that the 

29 user states his/her reservation value before the value of bids are revealed. 
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1 A more general system for dynamic customized advertising might allow a vendor to 

2 specify a bid for each type of buyer, and also a maximum budget, so that the vendor that 

3 places adverts can maintain control over its spending. 

4 We might also allow users to specify in their advert acceptance policy how many 

5 adverts they are prepared to receive a day, so that they are not inundated with too many 

6 adverts, even if they receive financial compensation. 

7 13.3.2.3 Numerical Example 

8 The advert auction message from the ad server arrives with a choice of three adverts, 

9 Adl, Ad2, and Ad3. Each advert is associated with a profiling function, a bidding function, 

10 and a value function. The client machine computes the hit rate for each advert, based on the 

1 1 profiling functions and its local profile for the buyer. Suppose hitl = 30%, hit2 = 5% and 

12 hit3 = 15%. The client machine also computes the expected revenue if the buyer hits an 

13 advert, revl = $2, rev2 = $8, rev3 = $3. The hit rate and revenue are used to compute bids 

14 for each advert, using the bidding function. Suppose that the bids are bidl = $0.50, bid2 = 

15 $0.70 and bid3 = $0.40. 

1 6 Now, the client also computes the acceptance level for each advert, based on the hit 

17 rates predicted within the iamworthit system. Suppose acceptl = $0.30, accept2 = $1.00 

18 and accept3 = $0.30. The bid for advert 2 is rejected because it is below the accept value. 

19 The bids for adverts 1 and 3 are accepted. Now, the auction is constructed with bids bidl 

20 and bid3, and acceptl because that is the accept-value for the bid with the highest value that 

21 is not rejected. The auction takes the bids (0.50, 0.40, 0.30), and sells the right to advertise to 

22 the user to vendor 1 for $0.40 (the value of the bid from vendor 3). 

23 Finally, the client fetches the graphics and URL link information for the advert from 

24 vendor 1 , and transfers payment from the vendor to the user for the right to show the advert. 

25 13.3.2.4 Tuning an Advertising Strategy 

26 Although the optimal strategy for a bidding agent in a single Vickrey auction is to 

27 reveal its true value for the right to show an advert to the user, the vendors are bidding over 

28 a number of different auctions, and might have a limited advertising budget. 
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1 Selecting appropriate users. 

2 One good approach is to start with a reasonable policy and then adjust it 

3 dynamically, based on feedback received from adverts using techniques from reinforcement 

4 learning (see [RN 97] for an introduction). Offline simulation with methods such as Monte 

5 Carlo simulations can provide good initial policies, based on information about a simulated 

6 population of users, which could be provided anonymously by the system of Secure Data 

7 Interchange. Users can be provided with incentives to reveal information anonymously 

8 about their acceptance functions, to allow this type of offline modeling. 

9 A vendor can tune an advertising policy with a random sample of users, simulating 

10 the auction that runs on a user's client machine. Metrics such as the average number of 

1 1 times that an advert is shown to a user of each type, and the average amount that the vendor 

12 pays to show the advert provide information to allow optimization. For example, the 

1 3 analysis might show that although a particular class of buyers are most likely to hit an 

14 advert, there is also a lot of competition to show adverts to buyers in that class, and the 

1 5 average cost to show an advert is high. In this case a vendor can conclude that it is more 

16 cost-effective to advertise to buyers in cheaper but less relevant classes. The analysis can 

1 7 also be used to check that the average buyer is not setting an acceptance level higher than 

18 the vendor's bid, which again would indicate that the policy should target a different set of 

19 buyers. 

20 Selecting appropriate adverts. 

21 The system that we have described can be extended to allow a vendor to select an 

22 advert to display to a user based on the type of profile of a user. For example, a vendor can 

23 pay for the right to be one of N vendors that compete in an auction for the right to advertise 

24 to users that hit the web page of a particular vendor. We can allow a vendor to not only tune 

25 its bid to the profile of a particular user that hits a web page, so that only users with a good 

26 fit with the service offered receive the web page, but we can also allow a vendor to tune the 

27 advert that it shows to a user. 

28 We allow a vendor to submit a number of different adverts and bid functions via the 

29 advertising network server. The client-side auction can be expanded to allow multiple bids 

30 from each vendor, where each vendor can submit a number of bids, but only pays the second 

3 1 highest price bid from another vendor. Again, the auction is truth-revealing for a vendor. 
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1 1 3.3.2.5 Automatic Advert-Replacement Systems 

2 Although there are potential copyright violations in any system which alters the 

3 content of information published by a vendor before displaying that information to a user, 

4 we comment that the above described system can be applied even with vendors web pages 

5 that do not subscribe to the advertising network in SDL Adverts can either be replaced, or 

6 added in separate windows. This enables any user that subscribes to SDI to receive 

7 personalized adverts and revenue from advertisers, irrespective of whether or not pages are 

8 within the SDI network. 

9 The key problem in ad replacement is to identify an advertisement within a page, i.e. 

10 information that is not related to the core purpose of the page. At present adverts may take 

11 one of two forms: 

12 (a) They can be statically, or dynamically, generated at the web server of the web 

1 3 page that the user hits, and pushed to the user directly at the same time as the 

14 general content. 

1 5 (b) They can be pushed to the user in parallel with the other non-advertising data on a 

16 web page, for example with a link to an advertising network. 
17 

1 8 The adverts in (b) can be identified by tracking ad server URLs that are embedded in 

1 9 the source code of web pages. Case (a) is more difficult because there is no identifying URL 

20 to indicate the content of the section of a web page. However, there are solutions to advert 

21 identification in these cases, for example via regular expression identification and the 

22 location of ad banners, as described in www.junkbusters.com and the "block files" that 

23 allow a proxy server at junk busters to strip adverts from pages. Another technique is to 

24 identity URL's that allow a user to click to another domain from within the current page. 

25 These might likely be adverts, consider for example, a link to www.buyanewcar.com on the 

26 New York Times web page. 

27 Another variation applies to television commercials and other broadcast medium, 

28 especially as the data storage capacity of local set top boxes increases. For example, a code 

29 is transmitted by national networks to cue local adverts from local network stations, 

30 allowing the replacement by personalized adverts from advertisers in the SDI ad network. In 

3 1 the future it is also likely that we will use digital storage devices to receive entertainment 
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1 and television programs that are broadcast, storing the information for future viewing, or 

2 even slightly delayed "on demand" viewing. With digital media and storage, it is possible to 

3 replace adverts inserted into programs by the producers with SDI targeted adverts, either 

4 with or without the cooperation of the source of the information. 

5 1 3.3 Additional Applications 

6 1 . Use of Real Time and Anticipated Vehicle Location Data to Provide Real Time 

7 Traffic Reporting and Predicted Traffic/Congestion Modeling 



8 

9 a. Real Time Traffic Reporting 
10 

1 1 A simple but useful application of LEIA involves on an opt-in basis real time 

12 transmission of a sample of driver's location information to a regional traffic- 

13 reporting bureau. Though existing prior art approaches are reasonably efficient, 

14 there are common circumstances in which congestion may have just occurred 

15 (e.g., following an accident) or wherein congestion has recently cleared up and 

16 such up to the minute information is not available to the traffic bureau. Moreover, 

1 7 this approach could be extremely advantageous in providing optimal navigational 

18 assistance for drivers wishing to optimally reduce driving time to a particular 

19 destination, which she/he submits to the system. The present system would 

20 consider the comparative length of different routes from the user's present 

21 location to the destination in combination with the anticipated average speed 

22 based on that of other vehicles traveling those particular routes. This enables real 

23 time recommendations to be presented to the user, based on this real time data. 
24 

25 b. Predictive Modeling of Traffic Patterns Based upon Real Time Location Data 

26 from Vehicles 
27 

28 It is possible to apply certain statistical techniques in order to predictively 

29 anticipate traffic patterns for some marginal period of time into the future based 

30 upon real time location data regarding the present case sample of vehicles. One 
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1 may consider recent past traffic patterns compared to present traffic patterns (i.e., 

2 change in speed as a function of time by segment of highway as well as absolute 

3 and rate of change in traffic volume). There are pre-existing models which are 

4 able to anticipate the corresponding effects of congestion at the moment in which 

5 it begins to occur which utilize these variables. This model must also consider the 

6 rate of change in average speed over each segment as well as the predicted impact 

7 of changes in traffic volume from feeder highways based upon changes in volume 

8 occurring dynamically thereupon. It is a key objective of the navigational 

9 assistance feature to utilize future predictive traffic models, in order to optimize 

10 the reduction in transit time for the most number of users possible, i.e., 

1 1 considering the volume of traffic which is predicted on a segment by segment 

12 basis in combination with the traffic models for the impact of these volumes upon 

1 3 transit time (including the rate of increase thereof). With this data, the system can 

14 determine, in advance, on an individual driver basis, which route for a given 

15 requested destination, will provide the shortest drive time for the user, while 

16 insuring that a near optimal transit time is achieved, across all segments of the 

1 7 highway system, as a result of each individual system recommendation to each 

18 driver. For example, an anticipated compounding congestion problem would 

19 warrant the vast majority of vehicles to take alternate routes until the congestion 

20 or predicted congestion is alleviated in that segment. 
21 

22 2. Use of LEIA for Creating Traffic Models for Purposes of Mapping Commercial 

23 Industrial and Residential Real Estate Market Opportunities - 

24 A very useful application of LEIA involves the collection of traffic pattern data 

25 on a time-specific basis (the day of the week and time of day) for the traffic, as it passes 

26 each piece of real estate. An electronic map which is ideally Web based (potentially 

27 nationwide is generated and constantly updated based upon this data. Additional 

28 information may be provided which may include (but is not limited to): 
29 

30 a. Origin and destination information of the vehicular traffic (as captured by 

31 LEIA) which may, especially if correlated with time, suggest the nature and 
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1 context of the driver's activities, e.g., rush hour traffic, errand traffic, etc. It may 

2 be useful to factor in the type of neighborhood the vehicle returns to every night, 

3 the type of commercial or business entity she/he drives to work to each day, etc. 
4 

5 b. Other activity-related clues which the user is willing to release, e.g., devices 

6 interacted with, content interacted with or transmitted information, etc., which 

7 may provide insights into the mind-set of which users tend to experience when in 

8 the vicinity of the real estate property. 
9 

10 c. User Profile Data - Aggregate purchase and content affinities as well as price 

1 1 elasticity data (gleaned from purchase statistics) could be very useful information 

12 to commercial real estate developers and purveyors. Users with the right profile 

1 3 and a receptive mindset are of particular interest. 
14 

15 It should be noted that the present system may be extended to residential real 

16 estate. E.g., what types of jobs (such as quality of jobs) do local commuters have? What 

17 are their numbers? How far do they commute (particularly if they tend to commute 

18 further than the present real estate site)? Do their commuting routes tend to pass the 

19 current potential real estate site? The last three questions would also be particularly 

20 relevant as well for a prospective industrial real estate development opportunity. 
21 

22 Industrial real estate developers also may be interested in mapped models of real estate 

23 depicting the professional and known likely educational characteristics of the associated 



24 local residents in that region? What are the other businesses at which they work? (If 

25 available) what are their particular positions/responsibilities? 
26 



27 3. Use of LEIA for Creating - Traffic and User Profile Models of Traffic Passing 

28 Billboard Sites and Providing a Map of Such Information on an Available Billboard Site 

29 Basis - 

30 The presently described techniques for providing dynamically updated 

3 1 informational maps containing detailed statistical data regarding vehicular traffic passing 
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1 real estate sites can be further extended to similar maps of interest to advertisers which 

2 contain locations for available highway billboards. The present system further provides 

3 Web-based access, which enables advertisers to make reservations and purchases of such 

4 billboards. In one preferred variation, an economic model is deployed to optimally price 

5 the billboards. I.e., a varied representative sample of each type of billboard sharing 

6 similar traffic/user profile features with others is auctioned for this purpose. In another 

7 variation this on-line auction model is deployed for all billboards available by the system 

8 in order to provide a novel service to advertisers which is a "billboard auction" site. 
9 

10 4. Use of LEIA for Enabling Drivers to Identify and View Sales Opportunities and 

11 Offers Associated with Physical Objects in the Surrounding Environment Utilizing 

1 2 Heads-Up Display Technology 

13 The present system is an extension of the virtual tag methodology in which 

14 physical objects contain meta-data and in which the location of such objects with respect 

1 5 to the user is determinable either by transmission of a signal, which contains the exact 

16 physical coordinates of the object or a scheme by which this information may be pre- 

17 loaded, which is typically from a server, which contains such information (thus "non- 
18 wired" objects may be identified accordingly) to the user's device, e.g., as she/he 

19 physically travels in the vicinity of such items. In either case, the present physical 

20 location of the object must be known in addition to that of the user (in the case of fixed 

21 objects, of course, the dynamic location tracking techniques are unnecessary). A primary 

22 commercial application of the present scheme applies to purchasable items. The user 

23 may submit his/her interest profile to SDI for purposes of being dynamically notified and 

24 disclosed of purchasables which match his/her user profile or specifically requested 

25 search criteria. The metadata may include a brief description and any additional level of 

26 information which the seller may be willing to disclose to that particular user. The seller 

27 may possibly request details regarding price elasticity of the buyer (purchase appetite) 

28 before quitting any discounts to the existing price. Prime example applications of the 

29 present scheme may include: 
30 
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1 1 ...Revealing used car sales opportunities to drivers (by actually flagging relevant 

2 vehicles which the driver passes (or which pass the driver); 
3 

4 2. Real estate and home sales opportunities (as well as apartment and commercial real 

5 estate rental/leasing opportunities; 
6 

7 3. User-user introductions in which one or both users may be extremely interested in the 

8 other based upon matching criteria within their associated social, information and/or 

9 professional desires and assets. 
10 

1 1 The preferred underlying technology for the present heads-up display system involves a 

12 technique for tracking the orientation of the user's head as well as the direction of his/her 

13 pupils (it may be possible, however, to implement the system by detecting pupillary 

14 direction based solely upon the position of the pupils relative to one another, using a 3-D 

15 optical tracking device. The iris of the user may also be utilized as a bio-metric identity 

16 of the particular user (see issued patents assigned to IrisScan Corp.) In the preferred 

17 implementation, the head-up projection device projects the virtual flagger upon the 

18 portion of the windshield or window(s) which is in direct line of site between the user's 

1 9 pupils and the physical object of interest. The technique may utilize heads-up windshield 

20 display technology and/or remote projection (e.g., for projection upon the side windows). 
21 

22 In another more advanced version of the system, the holographic projection system using 

23 micro-mechanical techniques may project the information directly upon the user's retina. 

24 This technique is detailed in co-pending patent application entitled "Remote Retinal 

25 Imaging Projection System" The variety of other potential applications of the present 

26 technologies are also conceivable which include: 
27 

28 SYSTEM FOR PROVIDING REMOTE ACCESS AND MODIFICATION 

29 CAPABILITY OF DOCUMENTS AND E-MAIL BY A SENDER 
30 
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1 The present system involves the use of a technique which enables the sender of a 

2 document, upon permission of the recipient, to provide subsequent updates and revisions 

3 to that document remotely and automatically in the absence of the recipient. Utilizing the 

4 computer's modem and e-mail delivery system, these updates could be provided remotely 

5 by the original sender (who alone can gain appropriate access privileges through a unique 

6 access code or authentication). The sender may also be able to identify whether or not 

7 the old version had been accessed by the recipient and possibly even at the level of the 

8 particular segment, or segments in which the specific changes had been made. (This 

9 capability could also be two way in which certification of not only receipt, but also access 

10 particular portions of the information by the intended recipient could be positively 

1 1 confirmed by the sender however, subject to approval by the recipient). If the recipient is 

12 amenable, such receipt confirmation could also be automatically time stamped by a 

13 secure trusted agent on the recipient's machine. The system could also perform the 

14 appropriate modifications in different drives on which the document is stored. If the 

15 appropriate modification was performed on the hard drive, but not a copy on the 

16 floppy(s), the recipient could be notified and prompted to insert the floppy(s) for 

1 7 appropriate updating until all versions of the original document are updated. It should be 

18 noted that such latter functionality would even be of independent utility to standard 

19 modifications as performed with word processing systems. The above same features 

20 could also be provided for e-mail as well (in a variation, time stamped receipt could also 

21 be applied to voice mail as well). The recipient if he/she had not approved the remote 

22 modification in advance, the sender could request the access/modification privilege of the 

23 recipient via e-mail or instant messaging service for example. The recipient may also be 

24 able to identify a digital time stamp for the relevant portion(s) of the document (or e- 

25 mail) as modified/updated by the sender. In a variation, XML metadata containing 

26 digital time stamps could also be utilized for the standard modifications, which occur 

27 throughout the document, i.e., the nature of the modification, i.e., the addition, change or 

28 deletion, the time it was performed, on which machine, under which password and if 

29 relevant by which remote user. Outside of the context of the present remote modification 

30 system this function could be integrated into a standard word processing system with 
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1 standard comparison check function. Such digital time stamps would help in proving 

2 identity and date of authorship at a detailed level. 

3 

4 In a variation of the present system, a user could also receive automatic notification 

5 if/when a recipient has accessed (or provided an associated modification to) a document, 

6 portion thereof, e-mail or voice mail. This would require the recipient to provide 

7 approval of the above privileges for the sender in response to a request prompt. In 

8 another variation of the present system, the provider of a URL may receive time-stamped 

9 verification of a user having accessed a URL (which may request a uniquely identifying 

1 0 access code or other authentication or the Web page may be accessed by that user who is 

1 1 provided with a completely unique URL, thus verifying access by that user for remote 

12 access, modification and associated time stamping. If the URL is not under control of the 

13 requestor of the verification the accessor' s client or Web-centric script (associated with 

14 his/her access account) could notify him accordingly with similar time-stamped 

15 verification of access. In another related application of the present capability, the 

16 accessor' s script could carry-out certified verification of more diverse or complex actions 

17 by a desired accessor, for example: 
18 

19 1 . Verify (and if desired notify) the requester of the accessor having sent or received and 

20 read document (or e-mail) with content X to or from recipient or sender Y. For 

21 example, knowing that recipient had read a message or document sent by the sender 

22 (or another sender) or otherwise accessed may prompt requestor to take the next 

23 action or communicate with recipient. 
24 

25 Or verify (and if desired notify) the requestor (e.g., an employer) of the accessor (e.g., 

26 an employee) having made telephone contact with X individual(s) or phone 

27 number(s) at a certain time(s) (or within a certain time frame and/or containing 

28 certain message-type or content). Or the content (contextual profile) of the 

29 conversation(s) may be revealed or confirmed as being within a target contextual 

30 domain(s). 
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1 2. Verify (and if desired, notify) the requestor (e.g., an employer) of the accessor (e.g., 

2 an employee) having performed certain definable and/or desirable volume of on-line 

3 tasks, e.g., typed certain type(s) of content, performed certain accounting, billing 

4 (telephone handling courteous) prompt or other administrative functions. Knowledge 

5 by requestor (e.g., via notification) may then prompt requestor to take a subsequent 

6 action (or e.g., communicate again with accessor). 
7 

8 3. For any of the above notify requestor (e.g., employer) of the failure of the accessor to 

9 perform certain desired (and/or volume) of tasks with pre-defined threshold 
10 parameters. 

11 

12 4. Verify and, if desired, notify requestor (e.g., teacher or parent) of accessor (e.g., 

13 student or child respectively) of certain on-line content consumed, e.g., Web pages, 

14 paragraph by paragraph content (including wireless e-books) or on-line homework, 

1 5 such as quizzes, essays, reports, in conjunction with or independent from such on-line 

1 6 content. 
17 

18 5. Verify and, if desired, notify requestor (e.g., employer or teacher/parent) of accessor 

19 (employee or student/child respectively) of accessing, generating or conversing 

20 (acoustically/telephonically or textually) certain content which falls outside of a 

21 certain defined set of content clusters which are deemed "appropriate" within certain 

22 time periods (e.g., work-time such as when the user is "clocked-in", homework time, 

23 etc.) or in general. And/or verify or notify accessor of certain clusters, which are 

24 accessed or generated within certain content clusters. The above cluster based 

25 identification rules may also be definable by not only time of day or day of week, but 

26 domain, e.g., e-mails and phone conversations of a "social" context or style are 

27 inappropriate while pornography is inappropriate during work time for employees or 

28 anytime for students or children. The present "conversation" based variation, may 

29 even be implemented in conjunction with a passively collected acoustic monitoring 

30 via the client's microphone of an employee's, student's or child's spoken 

3 1 conversations in/around his/her (or a) client machine. Identity may be determined in 
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1 conjunction with associated voice print information and bias results to documents, 

2 which match that style in accordance with the appropriate weighting criterion 

3 matching the user profile. 
4 

5 6. Give the user a set of selection criteria to bias the search (or recommended 

6 navigational links) by those styles, which s/he desires presently. 
7 

8 In addition, the present system could further actually use style as an additional means for 

9 identifying documents, which may be relevant to a search, e.g., determine what style(s) a 

10 preponderance of document in the present search tend to possess, then add this criteria as 

11 an additional weighting criteria to determining document relevance (or finding new 

12 potentially relevant documents). I.e., as part of the information retrieval (attribute 

13 cluster) model, use "style" as an additional attribute for and similarity to other documents 

14 and queries. 
15 

16 System for Mining User Reactions and Responses to On-line Media by User Profile 

1 7 Characteristics: 
18 

19 As described, are some primary examples of how the present technique can usefully 

20 be deployed. User profiling may also be usefully applied for purposes of collecting feedback 

21 about various types of user reactions and responses to various on-line stimuli (e.g. general 

22 Web and e-commerce sites to various products such as movies, music, interactive content, 

23 advertising news, interactions with other users etc. A somewhat related methodology was 

24 disclosed in the parent patent involving a technique for allowing users to rate vendors 

25 according to a variety of criteria and enabling the vendors (e.g. for marketing purposes) 

26 and/or users to observe statistical correlations between the user profile characteristics and 

27 ratings (using data mining techniques). In a simple example, application of the present 

28 methodology, the users viewing a Web page (containing any of the above suggested 

29 informational types) are allowed to provide their personal comments about the page or about 

30 their own experience in viewing the page. Informational retrieval and statistical NLP 

3 1 techniques may be used to cluster (using standard clustering techniques) both the comments 
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1 (by similarity of their content profiles) as well as the users (by similarity of their user 

2 profiles). In the first instance, the comment cluster exemplars are extracted, used as the 

3 comments which most closely exemplify each of their associated clusters and thus presented 

4 to vendors and/or users. The aggregate (average) of the user profiles associated with that 

5 cluster, the user profile of the user who provided the exemplar comment or the subset of 

6 user profiles which characterize subclusters of the comment clusters (wherein the 

7 subclustering routine is based upon the profiles of users within each comment cluster) are 

8 also presented to the users or associated vendors in conjunction with the exemplary 

9 comment most closely associated with that stereotypical profile of users. In a variation, 

1 0 which could be an automatically selected alternative if the exemplar fails to provide a clear 

1 1 representation of the various comments in the cluster (if further subclustering does not 

1 2 adequately achieve this objective), it may be possible to "combine" these various comments 

13 which are closest to the clusters' centroid by identifying those portions of the comments 

14 which are similar in meaning and those portions which are different. For similar portions, 

1 5 by preferentially utilizing the counterparts which are closest to the centroid and adding to it 

16 those portions which are different in piecemeal fashion, the desired objective may be 

17 achieved. 

18 In a variation, the different types of comments may be clustered and accessible to 

19 users via a hierarchical cluster tree used to create a menu of automatically labelled clusters 

20 (see parent patent detailing this general method). Users may access such menus by 

2 1 combining querying (of user or content attribute with menu navigation). 

22 It is of course reasonable to combine the technique in the parent patent application 

23 for statistically correlating user profile attributes with ratings associated with the users 

24 possessing those attributes. 

25 It is also reasonable to apply the present technique to video (or streaming video 

26 content) whereby during the course of viewing the content relevant survey questions are 

27 presented to users and if desired, the correlation statistics of the results with the associated 

28 user attributes are presented to the content owner and/or (preferably) also future viewers. In 

29 a novel and advanced variation, time shifted video technology would enable the ability to 

30 capture audio or full video/audio versions of the comments of the user. 
31 
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1 In a very novel application, it is possible upon permission of the associated users, to 

2 automatically construct menu trees (in accordance with the above method as disclosed in the 

3 parent case) of live voice or text chat in real-time dynamic fashion. Spoken conversations 

4 may be automatically profiled using acoustic speech-to-text methods. In accordance with 

5 the methods for automatic construction of virtual communities (in this case for dynamic 

6 chat) as disclosed in the parent case, these communities can be constructed automatically 

7 and on-the-fly based upon similarities of chat content, queries, navigational (content) 

8 selection from automatically constructed menu and/or user profile similarity or attributes of 

9 the profiles of the users which may be selected. Thus, in the former case a user could in the 

1 0 case of a pull down menu gain access by title of key phrase (which are ideally automatically 

1 1 and dynamically created and updated in accordance with the content of dynamic spoken or 

12 recorded conversation spoken or typed in real time and which may be further dynamically 

1 3 categorized and re-categorized within a hierarchical menu structure format. Although, the 

14 present system is implemented specifically for Web sites, a generalized user interface 

1 5 approach is certainly a very appropriate use of the present methodology. 
16 

1 7 Thus, in the former case a user could in the case of a pull down menu gain access by 

1 8 title or key phrase (which are ideally automatically and dynamically created and updated in 

1 9 accordance with the content of dynamic spoken or recorded conversation spoken or typed in 

20 real time from potentially any other user and which may be further dynamically categorized 

2 1 and re-categorized within a hierarchical menu structure format. Of course, these other users 

22 could, if desired, also specify constraints, e.g., by profile attribute of the user (accessor). If a 

23 chat dialog of a particular type of subject is scheduled in advance, the user could also be 

24 made aware of such posting if his/her profile so allows. 

25 The user could even select key terms or phrases which are most highly predicted to 

26 be of interest/relevance to the user based upon his/her profile. In any event, the queries 

27 could also be provided as persistent queries this enabling the user to be notified while on- 

28 line (e.g., while on his/her lap-top, PDA, television, or telephone whenever highly relevant 

29 dialogs appear (or even off-line if designated to be important enough, e.g., via a phone, 

30 pager, in an automobile, etc.). It is presumed that many of the interfaces to these devices 

31 will be speech enabled, thus speech-to-text techniques for purposes of the present 
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1 application are likely to become increasingly pervasive. The issued patent, "System for 

2 Customized Electronic Identification of Desirable Objects" further offices a technique 

3 which uses similarity measures for profiling users according to their proficiency knowledge 

4 in various informational domains by analyzing their ability to provide intelligent responses 

5 to questions of almost any sort and informational domain presented to them by other 

6 inquirying users. 

7 Accordingly, it is reasonable to also suggest to users functionality which allows 

8 them to be able to be notified of and/or receive messages, e.g., such as via an instant 

9 messenger service or a voice-enabled version thereof, in which potential recipients are able 

10 to select certain types of content as manually entered or selected via the menu tree which 

1 1 they would be interested in being made aware of or to be notified if7when individuals 

12 possessing attributes and, which may include proficiency attributes by topic, sub-topic or 

13 key word(s)/phrase(s)(again, so long as the user's profile is within the disclosure policy of 

14 the transmitting user. If a willing recipient so desires for both the chat and instant 

1 5 messaging variations, s/he may receive a copy of the text or voice transmission or if the user 

16 is not able to access it in real-time or otherwise. Conversely, the user may as a sender wish 

1 7 to sent the transmission to 
18 

19 1. All users who have indicated potential interest in the contents (and/or profile or identity 

20 of the sender) or users who match a particular user profile (e.g., interest or proficiency such 

2 1 as which is of measured similarity to the contents of the transmission) and/or 
22 

23 2. Users who are in the process of engaging in a user-user (or multi-user) dialog, which is of 

24 measured similarity to that of the transmission. 

25 An example application of the present system includes, for TV viewers, the ability to 

26 gain access to different types of feedback from other users who are also presently viewing or 

27 had previously viewed a particular TV program, e. g., a comedy, a news story or political 

28 speech in which the present technique could be used effectively as a filter allowing certain 

29 types of user or user attributes to reach the user and others to be suppressed or squelched. In 

30 the latter example, a user who considers herself to be a female liberal may provide settings 

3 1 during a campaign speech by a conservative Republican speaking out against abortion to 
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1 listen to acoustic/verbal gestures by other liberal females advocating abortion). Audible 

2 comments may be unfiltered and heard by the user, if desired. 
3 

4 If the program is a re-broadcast, more elaborate filtering capabilities are conceivable 

5 involving statistical analysis of the spoken language content (such could be conceivably 

6 performed for real time live information, however, at the expense of a slight delay). As 

7 suggested, it may be also possible to observe textually (e.g., through speech to text 

8 techniques) or hear spoken conversations as they exist between individuals possessing 

9 desired attributes, containing content characterized by key words or phrases from a menu 

1 0 selection or provided by^etween a desired individual or individuals respectively. 

1 1 In another application, users viewing video or streaming video content may upon 

1 2 their permission, agree to disclose their user profiles and be acoustically monitored such that 

13 comments, verbal, verbal/audible gestures and expressions and/or video of Effected thereby 

1 4 may be of interest to users 

1 5 In conjunction with an educational program a user may wish to observe comments 

16 by those other individuals who are extremely knowledgeable in the field (to collect 

17 supplemental information and other feed back about the presently viewed content). 

18 Similarly, in the political speech example the user may desire to hear the gestures and/or 

19 spoken comments, reactions and/or brief dialogues occurring during the course of the 

20 political speech by individuals sharing the same political views as the user and which are 

21 considered to be part of the educated elite. In another variation, individuals who 

22 represent the exemplars of different cluster of users (and/or those which are manually 

23 selected as providing interesting and/or entertaining feedback) could be heard by the user 

24 collectively and the reactions expressed as acoustic feedback could be statistically 

25 analyzed in dynamic fashion and presented to the user as a breakdown of the user 

26 attributes which presently characterize the present reaction or response of the virtual 

27 audience. 

28 In a future application, such system could be extended to such things as emersive 

29 virtual reality (stationary or non-stationary) systems and/or video gaining systems in 

30 which the reactions of characters (or even different "personals") to various situations and 
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1 events could be developed from data collected from the reactions of actual users 

2 comprising the different user clusters as above suggested. 
3 
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1 Applications to Constructing User Profiles and Matching Users by Similarities in Their 

2 Social and Psychological Profiles and/or Life Circumstances and Experiences 
3 

4 There is an untapped opportunity with potentially considerable and deep 

5 implications which could be used to enable very detailed assessment and associated 

6 profiling of individuals. The parent patent describes a very comprehensive methodology 

7 by which users could potentially be profiled so as to define their preferences across 

8 almost any kind of content or commercial products and services. Surveys used to reveal 

9 psychological traits are also used. In the present extension, we first suggest a primary 

1 0 system methodology in which extremely detailed information is collected and aggregated 

11 into the user profile particularly pertaining to the user" psychological profile (user 

12 preference/interest information demographics, etc. are also useful data in that there are 

13 likely correlations which exist with certain psychological attributes of the user). In the 

14 clinical field of psychoanalysis, much information regarding the psychological and 

15 psycho-pathological characteristics of the user is determined, however in compliance 

16 with the confidentiality requirements of the doctor-patient privilege, most if not all of this 

17 information remains isolated and never accessible by a large-scale statistical database for 

18 purposes of cross-correlation of psychological characteristics, phenomena, psycho- 

19 pathology (as well as other clinical pathology), as well as behavior, interests, preferences 

20 and more specifically identifiable behavior patterns as well as notable life experiences. 

21 Part of the answer to this dilemma may well lie in the application of secure data 

22 disclosure to SDI, the pseudonym proxy server and (for extra security) randomized 

23 aggregates for purposes of harvesting the statistical value within the data and (if needed), 

24 interacting with the user while maintaining completely secure and private individual user 

25 data. The present system suggests that there may be much greater accuracy which could 

26 be achieved in the profiling of individuals for use in a number of applications such as the 

27 psychological modeling of patients for use in the diagnosis, treatment, prediction 

28 (including predictive treatment) of psycho-pathology, the matching of "similar" patients 

29 together (for group therapy and/or pseudonymous electronic or physical mail 

30 correspondences) for providing support, inter-patient communication, counseling, etc. 

31 (for which present traumatic life experiences, if relevant, may be quite applicable as 
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1 well)., as well as the extended/improved modeling/prediction of user interests/preferences 

2 enabled by SDL 

3 

4 Another correlated data type could be employee file databases through which 

5 correlations and predictions of employee behavior and performance, recommendation as 

6 to remedies for existing problems and performance enhancing tactics again could be 

7 achieved. Unfortunately not all data is likely to be accessible for all users (e.g., 

8 browsing/transaction histories are likely to be available but often not information 

9 regarding a user's life experiences or psychological history). To fill in this missing data, 

10 rapid profiling techniques could be applied (see parent patent). Key aspects of a user's 

11 psychological profile for example, could be acquired, anything from their personal 

12 experience, beliefs, fantasies, habits, personality traits social behavior, etc., for 

13 preventative life style patterns. In an obvious example application, an individual could 

14 be very accurately profiled, psychologically in order to predictively determine likely 

15 strengths and weaknesses in his/her psychological and psycho-social aspects and thus 

16 determine recommendations for preventative life style patterns (which could be provided 

17 by professionals and/or determined statistically or confirmed/disproven in order to 

1 8 enhance potential strength and avoid or remedy potential or existing weaknesses (in as 

19 much as the pseudonymous statistical database would, of course, also contain the results 

20 of certain lifestyle therapeutic or pharmacological treatment approaches). It is believed 

21 that in the future, genetic sequence data which could be stored in the pseudonymous user 

22 database could also provide valuable clues as to the complete medical, psychological and 

23 psycho-social make-up of individuals. Another potential methodology could be to 

24 present particular emotion-evoking stimuli as a textual, verbal (or most ideally) 

25 multimedia videographic or even emersive virtual reality) presentation, capture details 

26 about the user's emotional response, comments, gestures, e.g., crying, laughing repulsion, 

27 mortification feeding of physical illness, etc., and perhaps even physiological feedback, if 

28 accessible.. With such large quantities of such sensitive data regarding individuals, SDI 

29 offers users a very powerful and compelling solution by which user's SDI agent can for 

30 any given application or circumstance select only those potions of his/her profile which 

31 will ultimately benefit the user. I.e., there are typically for most relevant applications, 
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1 e.g., employee hiring by corporate HR departments, health insurances, etc., extremely 

2 crude metrics which are used in the determination and/or terms of a prospective contract 

3 between such commercial entities and a user. There is, thus, considerable opportunity for 

4 SDI agents of the user to analyze user profile data in light of pre-determined models of 

5 what features and metrics are most significant in determining selection and/or terms of 

6 service or a professional relationship with that user based on historical data (through 

7 feedback) or publicly available informational models (e. g., as in the highly regulated 

8 health insurance industry). In order to initialize the statistical model for the more 

9 subjective example of employee selection and terms, the present system could , for 

10 example, disclose to the employer "Mr..X has a psychological profile feature which is 

1 1 highly correlated with a high aptitude for the present job which s/he is applying for and 

12 past job experience Y is 50% more likely than the average employee for that position to 

1 3 lead to a subsequent promotion and several "employee of the month awards" 
14 

1 5 Application to Identifying Similar Individuals Behaving Under Similar Circumstance 
16 

17 The present user profile information could also be extremely relevant for a novel 

1 8 Internet based service by which individuals could pseudonymously release their profiles 

19 and/or define specific situations and/or conditions (which could be actual, hypothetical or 

20 combination thereof) and the system will identify other scenarios (of like pseudonymous 

21 individuals and/or circumstances) to the general scenario and emphasizing and 

22 prioritizing certain particular aspects thereof. Examples include identifying individuals 

23 who had previously (or are presently) in the process of making a very similar professional 

24 decision, interpersonal decision (such as regarding a present or a potential spouse). 

25 Ideally, such interpersonal decisions would involve a very similar situation (such as a 

26 decision), a very similar individual in the analogous position of the user (and ideally, if 

27 possible other similar party(s) fulfilling the analogous role(s) as the present situation 

28 which confronts the user. The relative importance (priority) of the degree of similarity of 

29 various analogous roles in an analogous situation is determined in part by the type of 

30 information which the user would like to determine. For example, determining what a 

31 particular individual would do in a similar circumstance may depend upon how 



Page 364 of 370 



SUBSTITUTE SPECIFICATION (CLEAN COPY) 



1 dependent that user's action .or decision is upon the circumstances and conditions versus 

2 another individual(s) (and which individual(s) and/or the two (or more) individuals 

3 involved to which the context of the situation primarily relates. Of course, the nature of 

4 the action in question may predicate the relative importance of one or more of the above 

5 factors. These determinations are often quite subjective in nature. Requestees may also 

6 (if the choice exists) be situations which are either of a historical nature (i.e., the situation 

7 had occurred in the past) or of a present nature. In some cases in the latter case, is 

8 preferable as the user may actively request, e.g., the party to which s/he directly relates or 

9 is potentially affected by to provide advice and/or feedback or to what s/he would do 

10 under a particular "what if scenario which could be very specific and relevant to the 

1 1 present circumstances of the user. In a novel variation of the method, the party to which 

12 the user relates is commissioned by the user to do or say something in order to receive a 

1 3 reaction or response. Depending upon how similar the situation actually is, the prompt 

14 could be very similar or partially relevant to the action contemplated by the user in 

15 his/her own analogous situation. Or the user may wish to collect feedback from 

16 analogous individuals in similar situations regarding several hypothetical actions 

1 7 (perhaps acted out among several different groups) resulting from decisions in order to 

18 enhance the user's information about the user(s) and the associated situation. 

19 Alternatively, historical cases could include requests for valuable advice (as the 

20 experiences and the associated consequences are more likely to have been lived out). 

21 Typically it is most valuable (if a number of similar circumstances exist ) to collect 

22 multiple reactions/responses with which to form a statistical basis for the information. 

23 The biggest problem with the present system is collection of relevant and sufficiently 

24 detailed data relating to the relevant circumstances which must be identified and 

25 accessed. Psychotherapeutic (clinical) databases would be one source, as would be 

26 spoken or typed synchronous or asynchronous communications between individual as 

27 well as (the emerging) ubiquitous computing environments in which users' actions in 

28 spoken communications could be potentially monitored persistently off-line as well as 

29 on-line (for which SDFs privacy-enhanced profiling architecture would be particularly 

30 appropriate). Certainly, if the circumstances affecting the user (requestor) are of a long- 

31 term nature, periodic informational updates could be provided to him/her and/or the 
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1 mutual sharing and advising of the similar experience could be the basis for a two-way 

2 exchange of information between the parties on a short or long-term basis. 
3 

4 AUTOMATED LEARNING OF USER BEHAVIOR AS PART OF AN ENHANCED 

5 MEETING SCHEDULER AND CORRESPONDENCE FILTER 
6 

7 The present section provides application level extensions to similar inference- 

8 based rules recommendation and generation functionality such as is described in 

9 conjunction with the Smart Home description , smart office as well as the learning based 

10 methods disclosed within the section entitled "Resolution credentials" in the access 

1 1 reachability introduction and profile disclosure between two or more parties as disclosed 
12 

13 In order to utilize LEIA as a user agent, which is integral and ubiquitous within 

14 the work environment, it is necessary to make its learning as seamless and invisible from 

1 5 the user's perspective as possible. 
16 

17 The next level in applying the capabilities of LEIA is in being able to implicitly 

18 learn the context of the user's present temporal activities as well as establish relative 

19 priorities of the activities that the user (and others who intend to interface with the user) 

20 are engaged in and from the user's previous behavior to similar conditions automatically 

21 generate appropriateness functions (or rules) to automate the process of handling meeting 

22 schedules and filtering real time requests for correspondence with the user. It is possible 

23 to infer: 
24 

25 1 . The content profiles which are associated with the user's present activities. 
26 

27 2. (Often obviously) The nature of the user's present activities by observing what sorts of 

28 actions the user is engaged in, in the office. 
29 
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1 These content profiles and activity indicators may be passively observed by the 

2 user's present interactions (and timing thereof) with his/her PC, other smart appliances 

3 and data and voice communications which the user engages in (telephonically or from 

4 standard face to face dialogues) e.g., suggesting when the user is reading the morning 

5 news, checking phone messages, responding to e-mail, engaged in a particular project, 

6 etc. If these clues are not observable (or not positively identifiable) cyclical time 

7 dependent patterns may be used as implicit indicators of the user's present activities (see 

8 the above section entitled "Further Applications of LEIA"). Alternatively, the system 

9 could prompt a request of confirmation of the user's present activities and possibly on 

10 clues such as verbal regarding present and future established or changing activities and 

1 1 plans These same methods for utilizing time pattern elements which are used to predict 

12 the location of the user may be readily extended to predict or help to substantiate other 

13 statistical user variables relating to the types of actions and content related profiles 

14 characterizing the user's temporal behavior patterns. The following application of these 

1 5 techniques is used to enhance the automatic meeting scheduler: 
16 

1 7 The identity of the task or request, the activity and content attributes of the task or request 

18 profiles (target object profiles) may be developed utilizing the priority of that task 

19 relative to the user. This priority is estimated from the priority of the task/request (and its 

20 associated attributes) over other tasks/requests by the user. In order to better substantiate 

21 the relative importance to the user as inferred regarding these activities, the system 

22 should also display the scalar ratings which the user could adjust manually. 
23 

24 Nonetheless, there may be a degree of uncertainty in the user's intentions and 

25 LEIA's inference of the user's intentions. Moreover, unanticipated changeability by the 

26 user (uncertainty) may occur perhaps with increased frequency under certain variables 

27 like the identity of the requestor, the time of day, context of the users other present 

28 activities or when other meetings and obligations are pending which possess a relatively 

29 greater priority with the user (i.e., if they often occur spontaneously). In order to take this 

30 uncertainty factor into consideration, an overall statistical estimate may then be 

3 1 calculated taking into account the predicted statistical probability of each party ultimately 
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1 being available for the meeting. This serves the purpose of both refining the automatic 

2 scheduling process and informing would be visitors before hand as to the relative 

3 importance that their prospective meeting is (or increasingly becomes as the meeting time 

4 approaches) to the employee as well as the overall estimated degree of certainty of the 

5 meeting. This feature my be particularly useful for impromptu visitors where 

6 considerable uncertainty exists as to whether and to what degree the visitor is imposing 

7 upon the employee's schedule. 
8 

9 The temporal context of user behavior (using temporal time series analysis in 

10 conjunction with use of content analysis and user behavior assessment may be both 

11 important predictors as to whether a prospective meeting of a disclosed purpose is 

12 worthwhile for the employee based on present/future priorities and at which most likely 

13 times. By far, its most valuable application , however is in its ability to perform content 

14 analysis and user behavior assessment in real time in order to enhance LEIA's ability to 

1 5 determine whether and to what degree a would-be unannounced visitor or telephone or 

16 intercom callers unannounced contact is relevant to the employee 1 ' present activities and 

17 if not to coordinate with the visitor's agent the time or times when it would be most 

18 relevant to the employees anticipated activities for example there are certain activities 

19 during which an employee doesn't wish to be usually disturbed, e.g. while reading the 

20 morning news, taking a coffee break, calling home, responding to an important email 

21 message, ten minutes before an important meeting. Often within an organization 

22 individuals will attempt to speak with one another as such , the intended duration affects 

23 priority as does the identity of the individual in the organization. Often this variable can 

24 be automatically predicted by LEIA . In such a situation, the relative priority of the 

25 user's preexisting priority tasks is weighted against the priority of the requested tasks 

26 prior to notification. The above application can be readily extended as well to phone 

27 calls, in which the user could be automatically prompted by LEIA to disclose his/her 

28 purpose of the call and/or his/her identity if necessary. This application would be a 

29 useful extension to the telephony variation of the email filter described in issued patent 

30 entitled "System and Method for Electronic Identification of Desirable Objects in which 

3 1 the above additional inputs about employees and visitors may provide additional useful 
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1 feedback in automatically determining the rules dictating how to best handle the call or 

2 forward it to a more appropriate employee on behalf of the employee. If there is a certain 

3 degree of uncertainty in the agent's priority determination or assessment of the task 

4 affected the most appropriate rule the employee may observe the rule just prior to 

5 automatic implementation, or the action which it represents. The present value of the 

6 contextual employee activity assessment method as disclosed, if extended to advertising 

7 could be significant. For example the knowledge of the context of the user's present 

8 activities could determine what type and when an advertisement targeted to an employee, 

9 e.g. a decision maker will most likely elicit the best response (where the advertisement 

10 matches the professional interest profile) is an ad which corresponds with particular 

1 1 activities of the user ( e.g., which relate in some way with the ad/promotion. As 

12 discussed direct voice telephony and email dialogues may provide very useful clues 

13 (using natural language techniques) as to the particular profile interest summary which 

14 the user currently is experiencing at that moment both what types of user interactions and 

15 advertising is like to be most appropriate. Patent entitled System and Method for 

16 Customization Electronic Identification of Desirable Objects" suggests a variation of 

1 7 virtual community creation in which the use of present dialogue of users is provided as a 

1 8 means for determining whether or and if so which ongoing meeting dialogues within the 

19 organization would be most relevant for the user to be introduced to) either physically or 

20 virtually.. For example, virtual work groups (E. G. over industry wide extranets could 

21 use the above profiling approach in the creation of the group (by matching the 

22 employees' professional interest summaries with the project profile as well as in the 

23 dynamic ad hoc introductions of other experts who have a professional interest profile 

24 which matches the projects attributes such as human skill sets and proficiencies. There 

25 are also personality traits of users which may predispose a team towards success if an 

26 appropriate complementarity of these traits is exhibited within certain members of the 

27 team, e. g. the designers and engineers may be compliant, methodical and meticulous 

28 while the group leader may be impulsive results driven and flamboyant. Some members 

29 may be analytical and idea oriented, others more concerned with the end result and 

30 implementation strategy. Issued patent entitled "System and method for customized 

31 electronic identification of desirable objects" describes how queries may be routed to 
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1 knowledge domain experts in an organization environment .. LEIA may also identify and 

2 prioritize individuals who may be useful candidates to join into a meeting or address a 

3 specific request who are physically in proximity to the meeting location. 
4 

5 While the invention has been particularly shown and described with reference to a 

6 preferred embodiment, it will be understood by those skilled in the art that various changes 

7 in form and detail may be made therein without departing from the spirit and scope of the 

8 invention. 
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